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Research Preparation of Graduate Students 


The research preparation component of graduate programs in education has been a 
source of frustration and apprehension for generations of students. There are at 
least two reasons for the persistence of this phenomenon. One relates to the charac- 
teristics of the students; the other to the way the training institution treats them. 
In combination they generate a kind of self-fulfilling prophecy. 


Most students begin graduate studies having had little experience in, or much 
regard for, research activity. Their undergraduate education was designed to equip 
them to begin teaching by developing a knowledge base and a set of skills that 
would aid survival in the classroom. They then engaged in professional practice for 
a few years and were conditioned to rely on the folk wisdom of their colleagues as a 
guide to action. Throughout their professional socialization the actual and potential 
contributions of research to educational practice were generally unrecognized, if 
not derided. 

Usually, graduate studies is only a brief interlude or a part-time preoccupation. 
Enrolment occurs at a point when it is convenient or opportune in regard to their 
employment status and consonant with their personal life-style. There are often se- 
vere constraints on the time, energy, and money the students have to commit to 
the endeavor. What they perceive as placing the greatest strain on these resources 
is the thesis as well as the preparation associated with it. 


The novices’ skepticism and apprehension are reinforced soon after arriving on 
campus. Other students, sometimes to ease their own anxieties, regale them with 
horror stories, particularly about the course work in statistics. They encounter per- 
sons who are lingering on, long after most of their cohort has departed, because of 
their inability to get the thesis done. Staff members tell the newcomers that it is 
performance in the research courses that identifies the better students. 


What staff do not tell students about these apparently crucial courses is that 
their content is largely dependent upon who teaches them. Few university depart- 
ments have monitoring processes in place which enable them to know what is being 
taught and learned. Fewer still have a clear formulation of the scope and sequence 
of their research courses. Even fewer have policies which provide for the periodic 
renewal and rejuvenation of such courses. The result is a game of research roulette 
for graduate students which the vast majority will win by getting passing grades. 
However, what many students lose is the opportunity to gain an understanding of 
the various research strategies in the field of study and to develop the ability to 
plan and conduct research of significance. 

Admittedly, in a field of study as diverse as education and involving a creative 
activity like research, it is difficult to generalize and dangerous to prescribe. Yet 
surely, somewhere, there is someone who can develop a rationale and devise a re- 
search preparation component for graduate students which will imbue them with 
the spirit of discovery, allay their anxieties, and endow them with the capacity to 
advance knowledge. Perhaps that someone is you. Why not put your ideas in an 
article for the perspectives section of this journal? 

W.H.W. 
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Individually Administered 
Intelligence Test Scores: 
Equivalent or Comparable?’ 


The validity of equating the Slosson Intelligence Test, the Peabody Picture 
Vocabulary Test, and the Standard Progressive Matrices to the WISC-R was assessed 
in terms of psychological and statistical criteria. All four tests were administered to 
common, deeply stratified random samples of 712-, 912- and 11'%-year-old students. 
Pairs of tests judged to be psychologically equivalent and whose disattenuated corre- 
lation coefficient exceeded .70 were equated using linear and hand and computerized 
equipercentile methods. The different methods were equally sensitive to the relaxed 
definition of parallelism; the mean conditional-root-mean-square errors of equating 
equaled approximately 11 IQ points. It was, therefore, concluded that the equating 
proposed was not valid. Instead, comparable score distributions were realized for the 
age populations from which the samples were drawn. 


In 1922, E.L. Thorndike pointed out that the development of increasing numbers 
of intelligence tests necessitated a means for transmitting “a score obtained with 
one test into the score that is equivalent to it in some other test” (Thorndike, 1922, 
p. 29). More recently, Lennon (1966) claimed that “it is generally agreed that there 
is need for data that will permit conversion of scores or IQs derived from one test 
to [equivalent] measures derived from other tests” (p. 198). The rationale for the 
call for equivalent score conversions is based on the desirability of establishing an 
empirical means for making comparisons among individuals or groups using 
common-purpose test scores from different test sources. 


Dr. Rogers is amember of the Department of Educational Psychology whose research interests are in the area of measurement 
and evaluation, and research methodology. 


Dr. Holmes is a School Psychologist whose research interests are in the assessment methodology applied to school settings. 


Individually Administered Intelligence Test Scores 


It has been suggested that one fallacy in the interpretation of intelligence tests 
is that comparisons of this nature are often made on what may be an erroneous as- 
sumption of direct score correspondence between tests which are seen as psycholog- 
ically equivalent. The basic misunderstanding underlying this practice has been 
termed the “jingle fallacy”: Tests which bear the same name measure the same 
thing. This has fostered the notion that there is a common and therefore constant 
conceptual meaning in the numerical representation of IQ (Anastasi, 1976; Ebel, 
1972, 1976; Newland, 1973). Despite the fact that it has been repeatedly pointed 
out that direct score equivalence does not exist from test to test (Anastasi, 1976; 
Cronbach, 1970; Salvia & Ysseldyke, 1978; Sattler, 1974), there can be little doubt 
from a practitioner’s point of view that IQ scores from various tests of intelligence 
are used interchangeably for educational purposes (Barratt, 1956; Birkemeyer, 
1964; Ritter, Duffey, & Fischman, 1973). 


Functional equivalence alone, however, is based on simple face validity of tests 
having a common name and claim of purpose. Such evidence provides inadequate 
justification for treating tests as equivalent or interchangeable. Psychological and 
statistical criteria need to be added. The present study was directed toward this 
issue. Specifically, the question examined was whether it was feasible to establish 
the type of score conversions advocated by Thorndike (1922) and Lennon (1966) 
for individually administered intelligence tests. Four tests regularly used in schools 
were considered: the Wechsler Intelligence Scale for Children—Revised (WISC-R, 
Wechsler, 1974), the Slosson Intelligence Test (SIT, Slosson, 1975), the Peabody 
Picture Vocabulary Test (PPVT, Dunn, 1965) and the Standard Progressive 
Matrices (SPM, Raven, 1960). 


Equivalent and Comparable Scores 

Pertinent to the resolution of the issue of equivalent scores is the distinction made 
between equivalent score conversions and comparable score conversions. Although 
more readily interpreted, the development of equivalent scores is somewhat more 
problematic than the development of comparable scores. Thus, in light of the con- 
troversy surrounding the interchangeability of the intelligence tests, both equating 
and comparing were examined in terms of their technical requirements and their 
applicability in the present study. 


Equivalent Scores 
The technical feasibility of equating is determined by the parallelism of the tests 
involved. According to Angoff, 
equating, or the derivation of equivalent scores, concerns itself with the problem of 
unique conversions which may be derived only across test forms that are parallel, that is 
forms that measure, within acceptable limits, the same psychological function. (1971, 
p. 563) 
Unfortunately, Angoff did not clarify the meaning of “within acceptable limits.” 
Ideally, parallelism is defined in the restricted mathematical sense of classical true 
score theory (see Lord and Novick, 1968, p. 48). Gulliksen (1950) outlined two cri- 
teria for parallel tests, one statistical and the other psychological: 
In addition to equal means, variances, and reliabilities, parallel tests should have approx- 
imately equal validities for any criterion . . . The tests should contain items dealing with 
the same subject matter, items of the same format, etc. In other words, the tests should be 
parallel as far as psychological judgment is concerned. (pp. 173-174) 
Wesman (1958) pointed out that “these conditions are ordinarily most closely 
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approximated where parallel forms of a test have been constructed—forms which 
are intended to be interchangeable” (p. 8). 

For practical purposes, however, equivalent score conversions may be desirable 
among tests which were not specifically constructed in this manner, but which are 
used for the same purpose (Wesman, 1958; Flanagan, 1964; Lennon, 1966; Marco, 
Petersen, & Stewart, 1979). Wesman (1958) called attention to the fact that 

Assumptions we can make with regard to content and reliability of parallel forms of one 
test are not readily acceptable when we are dealing with two somewhat different tests of 
the same general ability. This situation is one in which the problem of equivalence fre- 
quently arises. (p. 8) 
Marco, Petersen, and Stewart described the strict psychometric requirements for 
equating as “ideal.” They suggested that these may be unrealistic in actual testing 
practice and that “scores must sometimes be equated under less-than-optimum 
conditions” (p. 1). The Anchor Test Study provides an example. According to 
Jaeger (1973) “the major question to be explored was the technical feasibility of 
equating tests that were not designed to be parallel” (p. 3). In that study, the psy- 
chological criteria were evaluated by a panel of experts. They judged the tests to 
require many of the same skills and abilities, but with less than perfect 
overlap. The statistical criterion was interpreted solely in terms of intertest correla- 
tion coefficients. When corrected for attenuation, they were found to be uniformly 
high. “Of the 207 corrected intercorrelations obtained in the Anchor Test Study 
data... only 20 might be considered significantly below, .95; all 20 were above .89” 
(Loret, Seder, Bianchini, & Vale, 1974, p. 8). The value of .95 is often taken as an 
arbitrary cutoff for the purposes of equating (Linn, 1975, p. 206). 


Lord and Novick’s (1968) notion of nominally parallel tests formalizes the re- 
laxation of the statistical restrictions imposed on strictly parallel tests. They 
suggested that the term “does not imply any mathematically defined degree of par- 
allelism in any particular respect” (p. 174). Nominally parallel tests are, rather, de- 
fined in terms of functional and psychological criteria. They are considered a 
“family” of different measurements, each of which measures “important aspects or 
manifestations of the psychological variable under study” (p. 174). In an earlier 
paper, Lord (1964) defined as nominally parallel “any set of test forms that are 
used interchangeably in common practice, even though they are not strictly parallel 
statistically” (p. 335). 


Comparable Scores 

Unlike the concept of equivalence, which is specific to the tests themselves and, as 
has been shown, is dependent upon parallelism, comparability refers solely to test 
score scales. Comparable scores can be achieved by scaling any tests in a common 
metric for a common reference group. The purpose of such common scaling is to 
assign the same numerical value to scores occupying the same rank in their respec- 
tive distributions. This can be done irrespective of test content and carries no im- 
plication of the interchangeability of tests. 


Angoff (1971) described comparing as follows: “[Comparing] may be thought of 
quite simply as the problem of ‘equating’ tests of different psychological function 
(p. 590). [The derivation of comparable scores involves] the problem of nonunique 
conversions of scores across nonparallel forms” (p. 563). As opposed to test equat- 
ing, Angoff stressed the broader application of the concept of comparing (i.e., any 
two scales can be made comparable) and the narrower interpretation of the 
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converted scores. Since the score conversions are nonunique, they are not 
generalizable beyond the group on which they were established (Millman & Linlof, 
1964). Lord (1950) pointed out that “comparability may be expected to hold ap- 
proximately, however, with respect to groups sufficiently similar to the basic group. 
Obviously, the basic group should in general be a representative sample of persons 
of the type for whom the tests are intended” (p. 1). 


The technical considerations in the establishment of comparable scores, there- 
fore, are focused on the nature of the group used for establishing the conversions 
rather than on the nature of the tests themselves. 


Equating Versus Comparing 

The distinction between equating and comparing is blurred somewhat by the in- 
consistent use of these terms in the literature (see Lennon, 1966, for example) and 
the formalization of the notion of nominally parallel tests. A further note of clarifi- 
cation seems warranted. Equating and comparing are referred to as types of con- 
version, where a conversion is an empirical transformation of the score scale of one 
test so that it is related to the score scale of another test in some defined manner. 
When this transformation yields an equivalency relationship, the score values on 
the two tests have identical meaning in terms of the trait being measured. Thus 
equal score values represent a quantification of the same trait irrespective of the 
test used. When a comparable relationship is established between two tests, the 
same numerical score values correspond to equal standard scores and, in the case of 
similarly distributed scores, the same rank position within a specified group. This 
is a scale relationship only and carries no connotation regarding the equality of 
possession of the amount of the trait or traits being measured. 


A clear example of the difference between the two, and one which exists within 
the context of this study, is as follows. The PPVT has two parallel forms: Form A 
and Form B. As such, it does not matter which form is used since they yield equiv- 
alent measures of receptive vocabulary. An individual would be expected to score 
similarly on both test forms. The WISC-R subtests, on the other hand, are compar- 
able tests. Although each subtest involves a different type of task, they are all 
scaled from 1-19 with mean 10 and standard deviation 3. Thus a scaled score of 12 
has a constant meaning in terms of rank across all subtests (that is, a percentile 
rank of 75). However, for any individual, the scores may be expected to vary con- 
siderably from one subtest to another. 

Comparability can be seen as the generic relationship, which accounts for much 
of the intermingling of the terms in the literature. A common definition of 
comparability is that the distributions of scores on two tests have been made iden- 
tical for a given group (Angoff, 1971; Flanagan, 1951; Lord, 1950). This applies to 
equivalent as well as to comparable tests (although the restricted reference group 
notion is not necessary for equivalent forms). Thus equivalent scores share the 
characteristics of comparable scores. Equivalence, however, is a specific case of 
comparability which is limited by the additional requirements of parallelism of the 
two tests. What is not yet clear is the extent to which the concept of equivalence 
extends to cover nominally parallel tests as described by Lord and Novick (1968, 
p. 174). 

The major question addressed in the present study concerned the validity of 
equating the SIT, PPVT, and SPM to the WISC-R. In this study, the WISC-R 
served as a reference test. The WISC-R, like its predecessor the WISC, shares 
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status among individual intelligence tests with the Stanford-Binet as one of the 
“stately, revered, and venerated devices against which all other tests [are] meas- 
ured” (Bersoff, 1980, p. 113). Reviewers acclaimed the WISC-R as an “even more 
valuable tool than its highly praised and well-used predecessor” (Krichev, 1978, 
p. 335) and “the instrument of choice for the psychological assessment of children’s 
intelligence” (Whitworth, 1978, p. 351). However, it is time-consuming to adminis- 
ter and its use is restricted to persons who have undergone extensive training in ad- 
ministration and interpretation. The combination of its accepted validity and its 
obvious cost has, since its inception, precipitated “a continual search for a test of 
mental ability which would produce comparable results” (Covin, 1977, p. 259). The 
SIT, PPVT, and SPM have been considered major contenders for the role of short- 
test alternate. But is it valid to apply equating in this situation where, as suggested 
earlier, it could not be expected that the tests would strictly satisfy the classical 
criteria for parallelism? 


Methods, Results, and Discussion 
The validity of equating the SIT, PPVT, and SPM to the WISC-R was examined 
at three sequentially related stages. First, the criterion of psychological equivalence 
was assessed. Technical criteria were then applied to the test pairs identified as 
psychologically equivalent. Finally, linear and equipercentile equating methods 
were applied to the test pairs which satisfied both sets of criteria. 


Psychological Equivalence 
In the context of the present study, psychological equivalence was defined as simi- 
larity of the psychological trait or function measured. The hypothetical constructs 
underlying the tests insofar as they are reflected in the statements of their respec- 
tive authors and in the content and format of the tests themselves were compared. 
In addition, the results of correlational and factor analytic studies reported in the 
literature were examined for evidence of intertest similarities. 


Space limitations preclude presenting the vast array of information and data 
considered in this analysis. Instead, presented here is a summary of the findings 
(see Holmes, 1981, for detailed findings). 


Table 1 contains a brief description of the content and factorial structures of 
the four tests. As shown, three types of measures defined in terms of the verbal, 
nonverbal, and general IQ scales of the WISC-R were used. The tests judged to be 
psychologically similar to the reference scales are indicated in Table 2. The appear- 
ance of C, A, or V marks the basis for identification: C—content; A—author’s 
statements of purpose; and V—results from empirical validity studies. The coincid- 
ence of two or more of these similarity factors suggested strong evidence for the 
psychological equivalence of test pairs. This is visually represented by the solid line 
enclosures in the body of the table. Test pairs having weaker evidence of psycho- 
logical equivalence are illustrated with a broken-line enclosure. 


Functionally equivalent test pairs are also represented in Table 2 by the letter 
F. It is obvious from this illustration that evidence for the psychological equiva- 
lence of the test pairs considered in this study does not generally substantiate the 
functional equivalence which is implied in their common use. 
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TABLE 1 


Content and Structure of the Tests 


Type of Measure 


Test Verbal Nonverbal General 


WISC-R Yields Verbal IQ score— Yields Performance IQ Yields Full Scale IQ 


supported by factor score—supported by score which combines 
analytic studies factor analytic studies Verbal and Performance 
Constitutes a measure 
of g 
SIT Verbal format. Similar Item diversity suggests a 
to S-B which has high more general measure 
verbal factor 
PPVT Strictly verbal 
SPM Some suggestion of Overtly nonverbal. Considered a measure of 
verbal components Considered as such by g by some 
Raven 
TABLE 2 


Psychologically and Functionally Equivalent Test Pairs as Suggested by their Content 
(C), Authors’ Claim (A), Empirical Validity (V), and Practitioners’ Use (F) 


WISC-R 
Verbal Performance Full Scale 
SIT CV A,C,V,F 
PPVT AGV F 
SPM A,C,V VF 


Technical Equivalence 

Subjects 

The population identified for the second and third stages of the study consisted of 
all non-Native, English-speaking children at three age levels—7%2, 912, and 11% 
years—attending public and independent schools in the province of British Colum- 
bia, Canada. The population was further restricted to exclude students enrolled in 
classes for the physically handicapped, emotionally disturbed, and the trainable 
mentally retarded. 

A two-stage stratified proportional sample design was used to select a probabil- 
ity sample at each age level. At stage one, the target sample size of 180 students 
was allocated proportionally to cells defined by geographic region, community size, 
and size of school. Since it was considered desirable to assess one student per 
school at each age level, the allocated sizes determined the number of schools to be 
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selected from each cell. The schools within a cell were ranked by their estimated 
enrollment at age 7% and divided into n substrata, n the sample allocation for the 
cell. To ensure representation of small schools (there were many rural schools with 
one to three students per age), similar size schools with enrollments less than 2n 
and within close geographic proximity were amalgamated and treated as one. 
Schools were then selected within substrata proportional to their estimated size at 
age 7%. Where possible, the same school was used for ages 912 and 11%. In cases 
where the allocated sample size was larger at ages 9 and 11%, or where the age 
7% school contained only primary (K-3) grades, the sampling procedure was re- 
peated at the other age levels for the particular cell in question. 

At stage two, students were randomly chosen from the sampled schools. Ages 
were defined within three months of the midyear; thus 7% years spanned a six- 
month range from seven years three months to seven years nine months. Ages 912 
and 11% were similarly defined. Equal numbers of boys and girls were assigned to 
each cell when the sample size for that cell was even. When cell sample size was 
odd, the sex of the “odd” person was determined randomly and the female-male 
quota of the next adjacent cell (within region) was structured to balance the num- 
bers. 


Procedure 

Test materials. The SIT, PPVT, and SPM (Sets A, B, C, D, and E) were 
administered without change. On the WISC-R, six items in the Information subtest 
and one item in the Comprehension subtest were reworded to “Canadianize” 
United States-specific content. The rewordings adopted were recommended by 
Vernon (1974, 1976, 1977) on the basis that their use with Canadian children 
yielded subtest pass percentages similar to those for the American standardization 
sample. As well, only 10 WISC-R subtests were given: Mazes and Digit Span were 
not administered. 


Test administration. In all cases, the WISC-R was administered by “level C” 
testers (Cronbach, 1970, p. 18). Generally, the same person administered all tests to 
a given student. In some situations, the non-WISC-R tests were administered by a 
learning assistance teacher or a first-year graduate student in school psychology 
trained to “level B” (Cronbach, 1970, p. 18) testing competence. Two testing ses- 
sions were used. The WISC-R was administered in the first session followed by the 
remaining tests, in six counterbalanced orders, in the second. 


Test scoring. To ensure consistency, all tests were scored following the direc- 
tions given in the respective manuals. Second party verification of a 10% random 
sample of tests revealed a .57% error rate for the WISC-R. For all other tests, the 
error rate was zero percent. The test scores were coded and keypunched with 100% 
verification. 


Demographic Description of Final Samples 

The actual number of students tested together with the number of usable tests 
were, respectively, 115 and 99 at age 7%, 117 and 108 at 9%, and 108 and 104 at 
11%. Part of the nonresponse was attributable to nonparticipation by 26 of the se- 
lected schools; the remainder was due to the failure to obtain parental permission 
which was necessary for test administration. WISC-R scores were obtained for each 
child tested. For a few students, however, scores were not available on the remain- 
ing three tests. For some of these children, testing was not completed because of 
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absenteeism. For the remaining, test protocols were considered unscorable due to 
errors in administration. 


Table 3 contains descriptions of the final sample for each age group in terms of 
the sampling stratification variables and a socio-educational variable. As shown, 
the relative proportions of children tested in each geographic region corresponded 
closely to the population proportions. The Metro and Kootenay regions were slight- 
ly overrepresented while the Okanagan and Fraser Valley regions were somewhat 
underrepresented. There was a slight sample bias toward schools located in large 
urban rather than rural areas. This is in keeping with the inflated representation of 
the Metro region. Any trend toward the disproportionate inclusion of large units 
which might be suspected from the data to this point, however, is disproved by the 
distribution for class size. More of the children tested were enrolled in small 
schools than was expected from the target sample allocation based on these popula- 
tion percentages. This proportion is attributed to the testing of more children in 
small schools in the Metro region than anticipated. 


The percentages shown in Table 3 for level of education of head of household 
were derived for the entire province. Often used as a stratification variable, it was 
not possible to use this variable during sample selection. Thus classification accord- 
ing to level of education was requested from parents after the sample had been 
drawn. As shown, a much larger than expected percentage reported the highest 
level of schooling completed to be senior secondary (grades 11 to 13). 


In summary, although the obtained sample sizes were less than the target size 
on which the allocations were based, the samples were dispersed in basically the 
same relative proportions in the population for all stratification variables. 


Technical Suitability of Test Pairs 

The technical suitability of the test pairs identified as psychologically equivalent 
was examined in terms of the reliability of the tests and the correlation between 
tests. As well, the characteristics of the score distributions were examined to deter- 
mine the appropriate method of conversions to use. Since order of test administra- 
tion was nonsignificant (analyses of variance yielded results consistent with chance, 
p<.25), these initial analyses and the subsequent score conversions were based on 
the total sample for each test. 

Characteristics of score distributions. The means and standard deviations are 
reported in Table 4 for each test at each age level. One sample statistical test re- 
vealed that, in general, the means were significantly higher and the variances lower 
(p<.05) than the corresponding values published in the respective test manuals 
(Holmes, 1981). The Kolmogorov-Smirnov one-sample goodness of fit test (Siegel, 
1956) indicated that, for each test, the hypothesis of normally distributed scores 
was tenable (p<.20). These findings suggested that since the score distributions 
were essentially the same, the results of linear equating should closely approximate 
the results of equipercentile equating (Angoff, 1971, p. 564). 


Reliability 

Table 5 contains the internal consistency and disattenuated correlation coefficients 
for the test pairs identified as psychologically equivalent (Table 2). The internal 
consistency coefficients were computed following the procedures outlined in the 
WISC-R Manual (Wechsler, 1974, pp. 27-28). Split-half correlations corrected by 
the Spearman-Brown formula were used for all subtests with the exclusion of 


W.T. Rogers and B.J. Holmes 


TABLE 3 


Representativeness of the Samples 
Population and Sample Percentages 


Age 
Demographic 7% (n=99) 9% (n=108) 11% (n=104) 
Variable AR al ib aN I cl ear a Be a it a BT a 
Pop. Sam. Pop. Sam. Pop. Sam 
Region 
Okanagan 14.9 13.0 ia), IL 12.8 14.6 12.0 
Metro 38.4 ALY 38.2 40.2 35.9 38.9 
Fraser Valley pi 7.8 11.0 8.6 11.6 95 
Vancouver Island iis 16.5 Li.3 18.8 17.6 17.6 
Kootenay 5.9 7.8 5.9 7.8 oO 8.3 
Northern ie 13.0 IW 5) 12.0 See, 13.9 
Community Size 
<1000 16.9 13:0 nO) 14.8 16.6 13.0 
1001-50,000 42.0 41.7 42.1 40.2 44.1 47.2 
> 50,000 Alea} 45.2 40.9 45.3 39.4 39.8 
School Size 
<150 12.6 Ae 10.9 14.5 8.7 12.0 
151-300 29.2 27.8 29.1 26.5 27.4 26.8 
> 300 58.2 By tt 60.0 60.0 63.9 ouleil 
Gender 
Male a mee ee AG = 52.8 
Female — 47.8 — 5338 — 47.2 
Education Level 
Head of Household 
<Grade 9 eis 10.5 
Grades 9 to 10 16.0 18.6 
Grades 11 to 13 27.0 38.0 
Post-Secondary, 
non-university 20.0 15.9 
university, college 23.6 17.0 


Note: The population percentages are based on 1976-1977 school enrollment data for region, 
community size, and on 1976 Census data for educational level of head of household. 
4Not available 


Combined across three age levels, n= 258. 
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TABLE 4 


Means and Standard Deviations 


Age 

Test TY 9% 11% 
WISC-R 

Verbal 106.08 106.01 101.68 

13.79 12.46 12.69 

Performance 109.92 108.86 107.68 

13.11 eZ evall 13.53 

Full Scale 108.16 107.89 104.80 

12.35 12.76 12.88 

SIT 114.95 L252 109.48 

11.90 12.68 15.42 

PPVT 106.53 M2 110.49 

1225 14/30 14.99 

SPM Dlinine 31.58 37.58 

7.24 8.06 6.54 


Note: The mean is listed first, followed by the standard deviation. 


coding. Guilford’s (1954, p. 393) formula for computing the reliability of a compos- 
ite group of tests was used to obtain reliability coefficients for the composite scores. 
The obtained values, which ranged from .78 to .93, were considered adequate for 
equating (Marks & Lindsay, 1972, p. 55). Of further interest was the size of the dis- 
crepancy between coefficients of tests to be equated. Angoff (1971) pointed out 
that to be equivalent, tests must be equally reliable (p. 568). However, in practice 
this condition is rarely met. Unfortunately there are no specific guidelines regard- 
ing the tolerance of equating procedures from exact equality of reliabilities. Appli- 
cation of statistical procedures to test the equality of reliabilities of two tests 
administered to the same sample depend upon the correlation between the tests. 
The nature of the dependency is such that the higher the intercorrelation, the more 
likely the null hypothesis will be rejected (see Feldt, 1980). Consequently, slight 
differences between the reliabilities of highly correlated (parallel) tests are found to 
be significant. Application of these statistical procedures appears to be too 
stringent a test. Marks and Lindsay (1972) found, using Monte Carlo procedures, 
that differences up to .20 did not significantly effect the results of equating nomi- 
nally parallel tests. They pointed out, however, that this may be an artifact of the 
equating procedure they used (p. 54). In the present study, the maximum reliabil- 
ity difference is .14 (WISC-R Verbal: SIT, age 912); the remaining differences are 
.10 or less. Within the range investigated by Marks and Lindsay, these differences 
were considered insufficient to warrant deletion of any of the test pairs for equat- 
ing. 

Intertest correlation. The intertest correlation coefficients, corrected for atten- 
uation using the basic formula .1,y=1,y/(y,tyy)”” and reported in Table 5, ranged from 
.48 to .94. Other than general agreement that tests, to be equated, should correlate 
highly, there are no definitive guidelines for setting a minimum value. In the pres- 
ent study, a value of .70 was used to identify test pairs for equating. Somewhat 
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TABLE 5 


Internal Consistency and Disattenuated Correlation Coefficients 
for Psychologically Equivalent Test Pairs 


Test Pair Age Internal Disattenuated 
Level Consistency® Correlation 


WISC-R Verbal: SIT 7% .86, .78 .94* 
9% 93, .79 16" 
11% .93, .88 oe 
WISC-R Verbal: PPVT 7% .86, .88 .68* 
91% Ah me fi OR 
11% .93, .87 Wists 
WISC-R: Performance: SPM T”% .88, .89 .50 
9% .83, .92 .60 
11% 82, .83 .68 
WISC-R Full Scale: SIT Ti’ .86, .78 hie 
9% .88, .79 .80* 
11% .87, .88 .89* 
WISC-R Full Scale: SPM TV, .86, .89 48 
9% .88, .92 .58 
11% .87, .83 .67 


2WISC-R listed first 
Selected for statistical equating 


lower than the values observed in the Anchor Test Study, correlations of this mag- 
nitude are commonly found between WISC-R scores and scores on other tests of 
intelligence (Sattler, 1974; Wechsler, 1974). Eight of the test pairs shown in Table 5 
were selected using this value: WISC-R, Verbal: SIT, all ages; WISC-R Full Scale: 
SIT, all ages; and WISC-R: Verbal: PPVT, ages 9% and 11%. A ninth pair, 
WISC-R Verbal: PPVT, age 712 was added, thereby completing the set for the 
WISC-R Verbal: PPVT. The correlational evidence was considered insufficient to 
warrant keeping any of the test pairs involving the SPM, even though they had 
been judged to be psychologically equivalent. 


Equating 

Procedure 

Both linear and equipercentile methods were applied to the nine test pairs “eli- 
gible” for statistical equating. Both methods were considered since, although it 
could be assumed that both would yield nearly equal results given the similarity of 
shape of the test score distributions, it was not clear which, if either, of the 
methods would yield acceptable results in light of the relaxed technical criteria 
used. 


WISC-R scores, linearly equivalent to an observed score on the SIT or PPVT, 
were computed using Lord’s (1950) linear transformation form for equal standard 
scores: 


Y 


< 
tI 

ee 
nK 
+ 
mx 
| 

ua n 
= 


(p. 5) 
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where Y's is the equivalent WISC-R score on scale w (w=Verbal, Full Scale) for 
person 1 with raw score y;, on test t (t=SIT, PPVT), andx,,,¥,,s,, ands, are the cor- 
responding means and standard deviations of w and t. Equipercentile equating in- 
volved the identification of corresponding percentiles in the w and t distributions 
and the assignment of equivalence to their associated score values. Two procedures, 
handgraphing (Angoff, 1971, pp. 571-575; Flanagan, 1951, pp. 727-730) and 
computerized or analytical (Lindsay & Prichard, 1971), were used. The accuracy of 
each method was assessed in terms of the conditional root-mean-square error of 
equating: 


where n is the sample size and Xi 18 the observed score for person 1 on WISC-R 
scale w (cf Bianchini & Loret, 1974a, p. 160). Not to be confused with the standard 
error of equating (Angoff, 1971, p. 570), this error measure reflects the degree to 
which a WISC-R score read from an equating table produced for the SIT or PPVT 
differed, on average, from the actual WISC-R score obtained by an individual. 


Results 

The results yielded by the different equating procedures were similar both within 
and across the nine test pairs that were equated. In light of this agreement, the 
presentation of results is restricted to the detailed results for one test pair and a 
summary of the conditional root-mean-square errors. 


Table 6 consists of a series of “equivalent” score values for the PPVT and 
WISC-R Verbal IQ at age 7% which were produced by the different equating pro- 
cedures. The scores used for equating were the renormed values for the British Co- 
lumbia population (see Holmes, 1981, pp. 103-108). Thus the mean IQ reported for 
the PPVT in Table 6 is 100.00 rather than 106.53 as listed in Table 4. It will be 
noted that the converted IQ values are reported to two decimal places. Since the 
purpose of this part of the study was to examine the efficacy of equating proce- 
dures, it was desirable to retain precision to the second decimal point. In the prep- 
aration of equivalency tables for practitioners’ use, however, these would be 
rounded to the nearest whole number (cf Loret et al., 1974). 

In the first column the raw score values for the PPVT are listed. The obtained 
score range on this test was 70-137. The next column contains the results of linear 
equating. The results of the equipercentile equating procedures are shown in col- 
umns 3 to 6. The converted scores from the hand-graphing procedure are listed 
first (column 3), followed by the results from the computerized analytical program 
(columns 4-6). Column 4 shows the interpolated scores, and columns 5 and 6 the 
converted scores resulting from the linear and polynomial curve-fitting procedures 
(Lindsay & Prichard, 1971). Below the table, the conditional root-mean-square 
errors of equating are presented to aid in the interpretation of the results. The 
subscripts L, G, I, LC, and PC indicate the correspondence of these errors to the 
linear score, graphic, interpolated, linear curve-fitting, 2nd polynomial curve-fitting 
columns respectively. 

Linear equating. As shown in Table 6, column 2, the converted values for the 
linear score procedure ranged from 71.24 to 135.65. The actual obtained score range 
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TABLE 6 


Comparative Example of Linear Equating and Equipercentile Equating 
for Converting PPVT IQs to WISC-R Verbal IQs 


(Age 742) 


TkXY .68, EXC ar .86, l'vyy = .88 


Equipercentile Equating 


Analytical 


Curve-Fitting 


PPVT IQ Interpolated 

Linear Graphic Scores Linear Polynomial 

710 TL 66.2 70.00 HAR36 A 
1S; 76.04 73.0 iss 76.25 69.43 
fi) TT OT, 74:6 72.67 77.88 qitos 
19 78.89 76.0 76.00 79.51 74.24 
80 80.85 76.8 76.50 81.14 WALD 
82 82.77 Thal 80.25 82.77 80.45 
84 84.70 81.5 81.00 84.41 83.56 
86 86.62 85.4 85.50 86.04 86.50 
87 87.58 87.5 88.00 87.67 89.19 
89 89.50 91.0 93.67 89.30 91.62 
91 91.42 93.7 95.20 90.93 93.77 
93 93.35 96.4 96.60 92.56 95.66 
94 94.31 97.6 99.00 94.20 97.34 
96 96.23 99.7 99.67 95.82 98.83 
98 98.16 101.4 99.83 97.46 100.18 
100 100.08 102.8 101.29 99.08 101.45 
101 101.04 103.6 101.57 100.72 102.68 
103 102.96 104.8 102.43 102.35 103.92 
105 104.88 106.0 104.67 103.98 105.18 
107 106.81 OES 107.14 105.61 106.52 
108 LOG TT 108.0 107.86 107.24 107.92 
110 109.69 109.4 109.50 108.88 109.42 
We Gill 111.0 IW SO) 110.51 110.99 
114 113.54 112.6 WN Wa AAS 
Ws 114.50 113.4 113.00 IH 114.26 
WY 116.42 115.0 115.00 115.40 115.91 
119 118.34 116.7 119.00 IOs LIS 2 
IPAIL 120.27 118.4 120.50 118.67 119.07 
124 2 ovaleey 121.0 122.00 121.93 IVA Tis! 
128 127.00 124.7 123.50 125618 123.90 
130 128.92 126.6 124.00 126.82 IDE Ties 
133 131.80 129.7 126.33 130.08 126.35 
135 gone ole6 OA Gil ISL Al 127,32 
MEM 135.65 133.8 129.00 133.34 128.65 
13.08 13.18 12.94 12.89 12.90 
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was 70 to 129. Taken across all the scores, the conditional root-mean-square error 
was 13.08. The extent of the discrepancies between observed and converted scores 
are shown more fully in Figure 1. This diagram shows a scatter plot of the obtained 
scores with the linear conversion line, represented by the long dashed line, 
superimposed on it. Discrepancies between an observed WISC-R Verbal IQ and the 
corresponding equated value for a given PPVT raw score are represented by the 
length of the vertical segment between an observed score point (represented be a 
closed circle) and the conversion line (e.g., see vertical segment A). 


2S SEROISINEAR 
INTERPOLATED SCORES 


TSO} 1s wl Date ge a eee CURVE FITTING 
600000 POLYNOMIAL CURVE FITTING 


140 
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90 


80 
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/0 80 90 100 110 120 130 
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Graphic equipercentile equating. The converted scores listed in the column of 
Table 6 labeled “Graphic” were derived following the steps outlined by Angoff 
(1971) and Flanagan (1951). As shown, they ranged from 66.2 to 133.8. The condi- 
tional root-mean-square error of equating was 13.18. 

Analytical equipercentile equating. As described by Lindsay and Prichard 
(1971) the first step of the analytical procedure was to produce interpolated 
distributions for both PPVT raw scores and WISC-R Verbal scores. The converted 
values reported in Table 6 correspond only to the obtained PPVT scores; the con- 
verted values corresponding to the obtained WISC-R scores are not shown. The 
second step of the procedure, curve-fitting, represented an attempt to fit a regres- 
sion line to the interpolated score distributions using both a linear and a polyno- 
mial functional equation. The bivariate interpolated score distribution and the 
linear and polynomial regression lines are shown in Figure 1. The conditional root- 
mean-square errors corresponding to each of these were 12.94, 12.89, and 12.90 re- 
spectively. 


Summary 
The characteristics of the equating methods and procedures which were demon- 
strated in the example are summarized below. 


Graphic versus interpolated equipercentile equating. The analytical procedure 
was included in the present study for comparison with the traditional hand- 
graphing procedure. This comparison involves only columns 3 and 4 in Table 6 (i.e., 
those columns labeled “graphic” and “interpolated scores”). As can be seen, these 
two score distributions are very similar. Again, differences are most evident in the 
extremes of the distributions where data for graphing are scant. Despite these dif- 
ferences, however, examination of the conditional root-mean-square errors of 
equating reveals that the two procedures are comparable. 


The curve-fitting procedures. Curve-fitting, through the application of linear 
and polynomial regression procedures, was included by Lindsay and Prichard 
(1971) as a further effort to define the shape of the conversion line. As can be seen 
by comparing columns 4, 5, and 6, the distributions of predicted scores are similar 
to the distribution of interpolated scores. This is reflected in the size of the respec- 
tive conditional root-mean-square errors. 


Linear versus equipercentile equating. The linear and equipercentile conver- 
sion procedures produced very similar results in the central areas of the 
distributions. Differences in converted score values are most pronounced in the ex- 
tremes. However, the conditional root-mean-square errors are of similar magnitude 
for both linear and equipercentile procedures. 


The linear and analytical equipercentile methods were applied to the remaining 
eight test pairs to be equated to examine possible differences in performance at- 
tributable to the different technical characteristics of each test pair (Table 5). Be- 
cause of the close relation between the graphic and interpolated procedures and the 
tedium associated with completing the former, there was no advantage in produc- 
ing the graphic converted scores as well. The results substantiated those illustrated 
above for the WISC-R Verbal: PPVT, age 712. As shown by the conditional root- 
mean-square errors provided in Table 7, there was very little difference between 
the linear and equipercentile procedures. Consequently, there is no preference for 
one method over the other in terms of the accuracy of the converted scores in rela- 
tion to the obtained scores. 
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TABLE 7 
Conditional Root-Mean-Square Error for Equating 


Equipercentile 
Curve Fitting 


Test Pair Age 
Level Linear Interpolated Linear Polynomial 


WISC-R Verbal:SIT TM” 10.19 10.28 10.34 12.18 
9% 12.53 12.11 12.46 14.08 

11% 8.78 9.00 8.72 9.26 

WISC-R Verbal:PPVT (Ce 13.08 12.94 12.89 12.90 
9% 12.65 PZLOT 12.65 1221 

11% 11.43 10.92 11.24 119 

WISC-R Full Scale:SIT ee 12.63 12.58 12.84 12.14 
9% 12.14 As) ARIES 12.83 

11% el eh) 9.85 10.62 


The size of the conditional root-mean-square errors can be judged both in com- 
parison to those found for the Anchor Test Study and in a practical sense as they 
pertain to test interpretation. In the Anchor Test Study, the conditional error of 
equating was computed for vocabulary, comprehension, and total reading for two 
distinct randomly selected test pairs at each of the three grade levels considered. 
The results were similar for both linear and equipercentile procedures. For 
equipercentile equating, the average conditional root-mean-square ranged from 
3.31 to 10.04; 15 of the 18 values were less than eight (Bianchini and Loret, 1974a, 
pp. 158-160; 1974b, Tables 2170-2187). In the present study, the results were again 
similar across the different equating procedures. However, the conditional root- 
mean-squares were higher and less variable, ranging from 8.78 to 14.08 (see 
Table 7). Using the obtained WISC-R scores as the criteria against which the 
equivalent scores were compared, discrepancies of +11-12 points were frequently 
observed. In reference to score interpretation, this suggests that a discrepancy of as 
much as +12 score points would be expected between an observed and converted 
WISC-R score value. Looked at another way, the average discrepancy exceeded the 
standard error of measurement (approximately five points) in most cases by a fac- 
tor of two. Deviations of this magnitude are intolerable for scores on which educa- 
tional decisions are based. Therefore, the magnitude of these errors was interpreted 
as evidence of the nonequatability of the nine test pairs which had been judged as 
psychologically equivalent and which met the technical criteria, relaxed from those 
associated with parallelism, used in this study. 


Conclusions 
The magnitude of the conditional root-mean-square errors of equating associated 
with both linear and equipercentile equating methods was interpreted as an 
indication of the nonequatability of the pairs of intelligence tests selected for 
equating in this study. In light of this finding, the criteria established for the 
equivalence of the tests are reassessed. Following that, the implications of the re- 
sults for testing practice are examined. 
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The Equatability of Nominally Parallel Test Pairs 

The justification for the equating phase of this study was established using the 
concepts of functional, psychological, and statistical equivalence. The functional, or 
practical, equivalence of the four intelligence tests used generated the research 
question regarding the feasibility of an equating study. Functional equivalence 
alone, however, is based on the simple face validity of tests having a common name 
and claim, and was deemed an inadequate justification for the consideration of par- 
allelism. 

The determination of psychological equivalence of test pairs was based on an 
analysis of the similarity of the psychological function measured by each test. Thus 
tests labeled as verbal measures of intelligence and having obvious verbal content 
(e.g., WISC-R Verbal and PPVT) were considered psychologically parallel. These 
designations were further supported by the results of previous empirical studies 
showing high correlations and/or factor loadings. The notion of content similarity 
was, however, defined in a broad sense rather than in terms of the balance of items 
measuring similar skill categories or domains across tests as in the Anchor Test 
Study (Jaeger, 1973). Thus, for example, a global notion of verbal measurement 
may be inadequate to account for differences in the unidimensionality of the verbal 
task required for the PPVT and the multidimensionality of the verbal tasks re- 
quired for the five WISC-R Verbal subtests. Although Lord and Novick (1968) pro- 
vided no guidelines for the degree of test similarity required for nominal 
parallelism, Marks and Lindsay (1972) stressed the notion of slight differences only 
among tests considered as nominally parallel. 


The lack of specificity regarding the size of correlations delimiting nominal par- 
allelism was similarly noted. Again, although Lord and Novick (1968) provided 
nonstatistical criterion of similarity, Marks and Lindsay (1972) referred to high 
correlations, while others suggested a cut off value of .95 (Loret et al., 1974, p. 8; 
Linn, 1975, p. 206). However, for the range of disattenuated correlations observed 
(.68 to .94), there were no appreciable differences in the sizes of the conditional 
root-mean-square error of equating. Thus it was concluded that although high cor- 
relation appears to be necessary for equating, high correlation alone is not suffi- 
cient. As suggested above, content similarity based on item and domain 
correspondence is also required. 


The Use and Interpretation of the Converted Score Scales 

From a practical point of view, the results of the study confirm the nonequivalence, 
that is, the noninterchangeability of the four individually administered intelligence 
tests examined. However, the converted IQ scales for the WISC-R, PPVT, and SIT 
are comparable; that is, given the similarity of the score distributions noted earlier, 
a given score value represents a constant rank position across tests for the popula- 
tion identified in this study. The comparability of IQ scores is implied in common 
usage when a constant meaning is associated with mean 100 and standard deviation 
15 or 16. The fact that the initial IQ scales were derived using different 
standardization groups and different score conversion procedures, however, violates 
the conditions of comparability. The scales produced in this study, computed using 
a common sample and conversion procedures, provide for the correct use and inter- 
pretation of comparable IQ score scales in the study population and in populations 
with similar performance levels. 
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Note 


1. Financial support for this research was provided by the Educational Research Institute of British Columbia. 
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Twenty-four undergraduate expository writing students wrote three affectively toned 
essays, four traditional expository essays, and two research drafts. The intensity with 
which they experienced various emotions before, during, and after writing was meas- 
ured repeatedly, as was the frequency with which these emotions were experienced 
when writing in general. Intensity and frequency declined across episodes. Increased 
positive affect intensity was observed after most episodes. Little negative affect was 
noted, but when changes after writing occurred they reflected diminished intensity. 
Students writing on their own reported generally lower intensities and frequencies of 
emotion, regardless of valence. Satisfaction with written products was directly related 
to positive affect intensity but inversely related to negative affect intensity. Type of 
assignment influenced emotions, and writing skill was related to affect differentially 
when rated by teacher versus student. 


Language as a cognitive process has recently been the focus of considerable theo- 
retical investigation. Written discourse, in particular, is said to function best as a 
conscious and intentional act that can be planned, tracked, predicted, and con- 
trolled (Flower & Hayes, 1981; Scardamalia, Bereiter, & Goelman, 1981). 


Emotion is virtually ignored in this research. When emotion is examined, it is 
expected to disrupt the process: writer’s block (Rose, 1984), writing apprehension 
and academic correlates (Daly & Miller, 1975b), writing anxiety (Bloom, 1980; 
Holladay, 1981), writing apprehension and personality correlates (Daly & Wilson, 
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1983), a scale measuring writing apprehension (Daly & Miller, 1975a), and reducing 
writing apprehension (Fox, 1980; Powers, Cook, & Meyer, 1979; Smith, 1984). 

A random search of autobiographical material of professional writers reveals 
wide and interesting patterns of emotions. The literature suggests that negative 
emotions move established writers to composing or to expressing their emotions in 
writing as easily, if not more easily, than positive emotions. Authors seem to be 
able to turn even depressive psychological states associated with diminished activ- 
ity into powerful motives for writing. Lashing out types of emotions appear to di- 
minish or be replaced by feelings of satisfaction or relief during or toward the end 
of writing. However, dissonant emotions seem to have as much potential for moti- 
vating writing than entirely positive or negative ones. Finally, generalized states of 
emotional arousal for which no value is necessarily attached also appear capable of 
mobilizing for writing and sustaining it (Brand, in press). 

On the basis of such anecdotal information, a first study (Brand & Powell, in 
press) of the emotions of student writers indicated that positive emotions increased 
and negative emotions decreased during writing. Skilled writers experienced more 
positive emotional change during writing than their unskilled counterparts and 
generally less negative feelings. All writers, regardless of skill, experienced more 
negative feelings when writing on their own than when engaged in required school 
writing. 

The purpose of this study is to extend the work summarized above. It is de- 
signed to identify relationships among emotion states and traits and writing epi- 
sodes in a college level advanced expository writing class. Of especial interest are 
the emotions associated with several types of writing exercises, skill, and writing 
self-sponsorship. In addition, self-rated emotionality and satisfaction with the writ- 
ten product is examined. 


Emotion states are defined as qualitatively distinct conditions of the mind that 
have bodily and physiological properties. The subjective quality of the experience 
constitutes its central feature, and we know it as feelings or emotion. Emotion 
traits are affective dispositions of long standing and may be likened to personality 
characteristics. Emotion states are temporary departures from the emotional 
substrata or transitory reactions to particular phenomena (Zuckerman, 1977). 


Method 

Sample 

Twenty-four college composition students from the University of Missouri— 
St. Louis participated in this study. These students were enrolled in the second se- 
mester of a one-year, lower division required advanced expository writing course. 
Eighteen students were under 25 years of age. There were 10 females and 14 males. 
The number of years these students had been writing on their own ranged almost 
evenly from zero to 20, with the median at 13. Four considered themselves easy- 
going emotionally, 17 rated themselves moderately emotional, and three rated 
themselves as very emotional. At the outset of the study, six subjects rated them- 
selves more skilled in writing than their peers. At the end of the semester six 
subjects were rated as skilled writers by their instructor. 


Instrumentation 
The Brand Emotions Scale for Writers (BESW Trait-When-Writing (TWW) form 
and BESW State (SD) form) (Powell & Brand, in press) is a 20-item, paper-and- 
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pencil self-report designed to measure the emotions involved in writing. The 
BESW was originally adapted from the Differential Emotions Scale (Izard, 
Dougherty, Bloxom, & Kotsch, 1974) which groups 30 emotion items into 10 a 
priori categories and has equivalent trait and state forms. 


The TWW form is concerned with how people feel in general when they write 
and how frequently the emotions are experienced during typical writing episodes. 
The S form is concerned with how people feel before, during, and after writing a 
particular piece and asks about the intensity of emotions at those times. The TWW 
and S forms include narrative questions. 


On the basis of a typology of the emotions (Davitz, 1969), three a priori factors 
on the BESW were identified: Positive, Negative Passive, and Negative Active. The 
Positive emotions consist of the items adventurous, affectionate, excited, happy, in- 
spired, interested, relieved, satisfied, and surprised. The Negative Passive emotions 
are ashamed, bored, confused, depressed, lonely, and shy. The Negative Active 
emotions are afraid, angry, anxious, disgusted, and frustrated. 


Several studies were conducted to assess the validity and reliability of the 
BESW scale, with validity coefficients ranging from .62 to .84 for the three factors 
on the emotion state observations and .64 to .80 for the three factors on the emo- 
tion trait observations. Reliability coefficients ranged from .71 to .85 (Norton, 
1985). The factor analysis yielded two factors for each observation, accounting for 
36% to 44% of the total variance on the three observations (i.e., TWW, S form be- 
fore writing, S form after writing, Powell & Brand, in press). 


Procedures 
On four separate occasions during an eight-week summer session, students 
completed a demographic data sheet and three TWW forms. They also completed 
S forms immediately before (Pretest) and after (Posttest) nine required writing as- 
signments. During three of the writing episodes, students were interrupted and 
were administered a Midtest S form. Except for the writing episode analyzing the 
word “real,” all topics were student-selected. The writing schedule follows. 
Week 1: Pre/Posttest S form about a confusing experience 
Week 2: Pre/Postest S form with a descriptive essay 
Week 3: Pre/Posttest S form with an analytic essay 
Pre/Posttest S form (with Midtest) about an adventurous experience 
Week 6: Pre/Posttest S form with an early draft of a research paper 
Pre/Posttest S form (with Midtest) about a boring experience 


Week 7: Pre/Posttest S form with a comparison/contrast essay 
Week 8: Pre/Posttest S form (with Midtest) with an analysis of how we use the 
word “real” 


Pre/Posttest S form with the final version of research paper 


Students were asked to be honest and accurate and told that their responses on 
the demographic data sheet or the BESW would have no bearing on their grades. 
The number of subjects per writing episode varied from 16 to 24. Ten subjects were 
present for all nine episodes. 
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Results 
Descriptive statistics, trends, and mean differences are reported first according to 
the sequence in which the observations were made. The alpha criterion in all tests 
of statistical significance was .05. Only trend tests were directional. 


Means and Trends Across Episodes 

Table 1 shows descriptive statistics for the emotion scales across time. Pre- and 
posttest Positive S means declined sharply early in the semester. The remaining 
slope was moderate and negative, but plummeted during the Boring Experience 
episode. By the end of the semester the Positive S pretest mean was 5.2 points be- 
low the corresponding pretest mean of the first writing episode (on a 9- to 45-point 
scale). The final Positive S posttest mean was 5.6 points below the first posttest 
mean. Spearman trend coefficients (rhos) were —.64 (p<.05) and —.84 (p<.005) 
for the pretest and posttest means respectively. 

A similar decrease in Negative Passive pretest and posttest means was ob- 
served. The final pretest mean was 2.2 points lower than the first (on a 6- to 
30-point scale); the corresponding final posttest mean was 2.3 points lower. The 
trend coefficients for pretest and posttest means were —.83 (p<.005) and —.62 
(p<.05). Again, slopes were temporarily interrupted during the Boring Experience 
episode. 

The Negative Active S data resembled those of the Negative Passive. The final 
pretest mean was 2.1 points lower than the first pretest mean, and the final 
posttest mean was 2.7 points lower than the posttest mean of the first episode (on a 
5- to 25-point scale). Pretest and posttest trend coefficients were —.75 (p<.025) 
and —.27 (n.s.), respectively. All successive TWW means declined. 


Mean Differences 

Mean differences are summarized in Table 2. Although directionally identical to 
the declining trends in pre- and posttest means, statistical significance was not 
reached for trends in mean differences (rhos were —.362, —.058, and —.112, for the 
posttest-pretest Positive, Negative Passive, and Negative Active, respectively). 
Within episodes, however, seven of nine Positive S posttest means were signifi- 
cantly greater than pretest levels. Three of nine Negative Passive S posttest means 
were significantly less than pretest levels. But only one Negative Action posttest 
mean was significantly less than its pretest counterpart. 


Among the three episodes in which midtests were available, two Positive means 
were significantly greater than pretest levels, and one Negative Passive S mean was 
significantly less than its pretest mean. A significantly greater posttest than 
midtest Negative Active mean was observed for one episode. 

The means for all three scales of the third TWW were significantly lower than 
their first TWW counterparts. Second TWW Positive and Negative Passive means 
were significantly lower than those of the first. The third TWW Positive mean was 
significantly lower than that of the second TWW. 


Writing Achievement or Skill 

Affect was also examined according to writing achievement as rated by both the in- 
structor and the students themselves. Instructor ratings were measured on a 2- 
point scale. Student self-ratings were measured on a 3-point scale. Seventy-five 
percent or 18 of 24 instructor and student ratings agreed. The frequencies with 
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TABLE 1 


Descriptive Statistics for the Emotion Scales 


sss... 


Positive Negative Negative 
Passive Active 
Date N M SD M SD M SD 
TWW Scales 
First 24 26.3 5.1 12.0 4.3 W23) 3.0 
Second 22.8 6.1 10.0 2D 10.6 2.8 
Third 19 20.5 5.5 9.8 Det 10.2 3.0 
VM Scales 
First 20 15.8 6.1 8.8 1.9 7.8 2.4 
Second 19 15.2 6.7 9.7 a 9.2 BuT 
S Scales 
Confused Pre 23 PAW? (5.5) 9,9 (2.9) 9.7 (3.9) 
Confused Post 23 26.0 (7.4) 9.7 (4.3) 9.8 (5.2) 
Descript Pre 20 19.0 (6.3) 9.4 (8.2) 9.2 (3.4) 
Descript Post 20 24.6 (6.7) 7.6 (2.0) 6.9 (2.0) 
Analysis Pre 20 18.3 (6.2) 8.0 (1.5) 8.1 (2.8) 
Analysis Post 20 21.8 (jez) 7.9 (2.0) 7.6 (3.7) 
Adventure Pre 18 15.8 (6.5) 8.7 (2.8) Weld) (1.9) 
Adventure Mid 18 19:5 (Gea) 7.0 (Qt) 7.8 (2.8) 
Adventure Post 18 21.8 (6.9) 73 (2.4) 6.8 (2.3) 
Research D Pre 19 16.8 (5.9) 8.1 (1.8) 7.8 (3.4) 
Research D Post 19 20.8 (8.1) 7.6 (2.0) 7.9 (4.3) 
Boring Pre 16 14.3 (4.4) 8.4 (1.9) 6.8 (1.6) 
Boring Mid 16 14.1 (4.4) 8.3 (2.0) 7.5 (2.6) 
Boring Post 16 15.2 (5.8) 8.9 (2.7) 6.7 (1.9) 
CTC Pre 18 ileyfea! (7.0) 7.6 (3) Tee: (1.9) 
CTC Post 18 20.0 (6.7) 6:9 (a2) 7.4 (4.0) 
Real Pre 18 16.0 (4.8) 7.4 (1.8) (ei (2.1) 
Real Mid 18 19.8 (7.1) 714 (2.3) 7.6 (2.6) 
Real Post 18 20.6 (7.9) Tee (1.8) 7.5 (3.3) 
Research F Pre 16 16.0 (6.3) Mall (1.3) 7.6 (2.0) 
Research F Post 16 20.4 (9.1) 7.4 (1.8) Wall (2.9) 


which episode means were numerically greater for skilled versus unskilled writers 
were evaluated through cumulative binomial distributions. The probability of a sin- 
gle chance occurrence was set at .5. Separate analyses were computed for 
instructor-rated and student-rated skill. No significant differences were found. 

Instructor-rated skilled writers reported higher Positive affect on all three 
TWW observations, and higher Negative Passive and Negative Active mean fre- 
quencies than unskilled students on two of the three TWWs. Self-rated skilled 
writers reported identical mean frequencies for the Negative Passive factor among 
all TWWs but lower mean frequencies of Negative Active emotions on two TWWs. 
With only three observations, statistical significance was not obtainable. 


Students above the median (20) in years of writing on their own were consid- 
ered self-sponsored writers. They reported significantly more Negative Passive af- 
fect than other writers on only one observation (p=.011), and no instances in 
which Negative Active affect was more intense than those of the nonself-initiators. 
By and large, self-sponsorship of writing and emotion traits were not associated. 
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Emotions were contrasted for students above and below the median in self- 
rated emotionality. The more emotional students reported significantly more 
Negative Active feelings on 89% (p=.011) of the observations. Similarly, the more 
emotional students reported higher mean frequencies of Positive and Negative 
Active affect on all three TWWs and higher mean frequencies of Negative Passive 
affect on two of three T'WW observations. 

Students above the median in satisfaction with each of their written products 
reported significantly higher Positive feelings than less satisfied students (p=.004). 
Eight of nine Negative Active factor posttest means were significantly lower 
(p =.0392) for the more satisfied students. 


TABLE 2 


Mean Differences 


Positive ; Negative Negative 
Passive Active 
M Diff t M Diff t M Diff t 

TWW Scales 

Third-First ==liitsy al (Ol: ee, = ees =O = OLS} 

Second-First =o) Olina x0) =Philte == ily 2.03 

Third-Second Oe = SE (0 —0.00 (vi! =0:16 
VM Scales 

Second-First —0.6 —0.49 0.9 lel 1.4 Mea 
S Scales 
Confused 

Posttest-Pretest 4.8 Doin Ne —(0.25 0.1 0.09 
Descriptive 

Posttest-Pretest 4.6 DEG EX ts a Oa PL oes 
Analytical 

Posttest-Pretest 3.0 Weovre (a 7 a5 — 0.69 
Adventure 

Posttest-Pretest 6.0 A 7idleess =I. 4 = DL (ors —0.7 iets) 

Midtest-Pretest 3.7 Boles = 112 =e 0.3 0.45 

Posttest-Midtest Dee Go (0 —0.88 =a ly 
Research Draft 

Posttest-Pretest 4.0 2.65% — (0) == 0.1 0.10 
Boring 

Posttest-Pretest 0.9 0.64 0.5 0.69 =—4 0) | = On6 

Midtest-Pretest — O12 —0.25 == (Lil =(2hs 0.7 1.07 

Posttest-Midtest ea 1.41 0.6 0.88 —0.8 =e) 
Compare & Contrast 

Posttest-Pretest 2.9 DSi e Ney =e 0.1 0.14 
Real 

Posttest-Pretest 4.6 eile =()k2 — 0.84 0.4 0.88 

Midtest-Pretest 3.8 2295 0.0 0.00 0.5 1.24 

Posttest-Midtest 0.8 0.69 = 070) AN)! a O26, 
Research Final 

Posttest-Pretest 4.4 7a = (08) SONS 05) = 0:98 
a eet Se ie Sey ee whe Tae lege Seer eee eee 

*p<.05 

hos (OI 
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Types of Assignment 

Pretest to posttest mean differences were ranked and examined by the following 
breakdown of assignments: those with an affective tone, that is, Confused, Adven- 
turous, and Boring experiences; traditional expository writing topics, that is, 
Descriptive, Analytic, Compare/Contrast, and Examination of the word “real”; and 
Research Draft and Final Research Paper.’ 


Assignments with Specific Affective Tone 

Among the nine assignments, the Adventurous and the Confused experiences 
ranked first and second highest in increase of Positive affect. Negative Passive 
emotions decreased numerically for the same two episodes, but decreased signifi- 
cantly only for the Adventurous assignment. The Adventurous experience ranked 
second highest in both decreased Negative Passive and Negative Active emotion. 
The Boring experience episode ranked last and was the first episode for which 
there was no statistically significant increase in Positive affect. It was also the only 
episode for which there was a numerical increase in Negative Passive affect. 


On the three assignments involving affective tone, instructor-rated skilled 
writers reported greater Positive emotion as frequently as the unskilled. Self-rated 
skilled writers reported more intense Positive feelings on 83% of the observations. 
Instructor-rated skilled writers reported stronger Negative Passive and Negative 
Active means 33% of the time. Self-rated skilled writers reported stronger 
Negative Active emotion 33% of the time, but greater Negative Passive affect only 
17% of the time. 

Self-sponsored and nonself-sponsored writers reported stronger Positive emo- 
tions with equal frequency. But in none of the observations was the Negative affect 
of self-sponsored writers higher than those of students not writing on their own 
(p=.03). 

More emotional students reported stronger Positive emotions than the less 
emotional students on 33% of the observations, more Negative Passive feelings on 
67%, and Negative Active affect on 83% of the six observations. Students above 
the median in satisfaction with their affectively toned compositions reported con- 
sistently higher Positive factor posttest means (p=.03) and consistently lower 
Negative Passive and Active factor posttest means (p=.03). 


Traditional Expository Assignments 

The traditional expository assignments asked students to (a) Describe, (b) Analyze, 
(c) Compare/Contrast, and (d) Explain the use of the word “real.” The Descriptive 
assignment stands apart from the others in this group in one important way. It was 
the only episode in which changes in all three factors were statistically significant. 
The Descriptive assignment ranked highest in decreased Negative Passive and 
Negative Active affect, and it tied with the “real” assignment as third highest in in- 
creased Positive affect. A significant decrease in Negative Passive affect was found 
in the Descriptive and Compare/Contrast episodes. 

Instructor-rated skilled writers reported greater Positive intensity across these 
observations than their self-rated counterparts. Instructor-rated skilled writers re- 
ported greater Negative Passive affect 50% of the time, while self-rated skilled 
writers reported stronger Negative Passive feelings only 12% of the time. Similarly, 
instructor-rated skilled writers reported greater Negative Active affect 38% of the 
time, whereas self-rated skilled writers reported greater Negative Active affect 
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twice (25%). While instructor-rated skilled writers reported stronger Negative 
emotions on the Descriptive episode, their unskilled counterparts reported stronger 
Negative emotions on the Analytic and real episodes. 

Self-sponsored writers reported stronger Positive affect on half of the 
traditional group observations, stronger Negative Passive affect only once, and 
stronger Negative Active affect not at all (p=.008). The more emotional students 
reported stronger Positive and Negative Passive affect on 62% of the traditional 
group observations and greater Negative Active affect on 88% of them. As was the 
case for the emotionally toned episodes, students more satisfied with their 
traditional compositions reported consistently higher Positive factor posttest 
means and lower Negative Passive and Negative Active posttest means (p=.008). 


Research Assignments 

In magnitude of increased Positive affect, both Research episodes ranked in the 
middle of the distribution. The Final Research paper ranked higher than the Draft 
in decreased Negative Passive and Negative Active affect. In terms of skill, there 
was no difference in Positive and Negative Active affect between instructor- and 
self-rated skilled writers. However, the instructor-rated skilled group reported 
stronger Negative Passive feelings on 50% of these observations, while the self- 
rated skilled group reported this for 75% of them. On the final research paper, 
instructor-rated unskilled writers reported stronger Positive and Negative Passive 
feelings. 


Self-sponsored writers reported consistently more Positive and less Negative 
feelings on both Research exercises. No difference in frequency of Positive affect 
was found when emotionality was examined. But the more emotional students re- 
ported more Negative Passive and Negative Active affect on all Research observa- 
tions. The relationships between writer satisfaction, the research products, and 
mean affect intensity differed from earlier findings. The more satisfied writers of 
Research Drafts reported stronger posttest Positive and Negative Passive emotions 
but weaker posttest Negative Active emotions. The more satisfied writers of the 
Final Research paper reported stronger emotions on all three posttest factors. 


Discussion 

Despite the small sample size and great number of variables, several findings stand 
out. Emotional intensity clearly decreased as the semester progressed. Slopes were 
especially sharp early in the semester then seemed to level off. Echoing the 
stabilizing effect of emotions across episodes, students perceived themselves as 
feeling emotions less frequently when writing in general. Significant decreases in 
trait frequencies were present in six of nine comparisons, and all mean differences 
were negative. Emotion traits are theoretically stable. The changes in trait factors, 
however, suggest otherwise. One hypothesis is that emotion traits in writing change 
as writing experience increases. Another explanation is that students may have 
overestimated the degree to which their state conditions could be generalized to 
other writing contexts. 

But the most plausible explanation for state and trait changes involves stu- 
dents’ attitudes toward the class. That school apathy is endemic is conventional 
wisdom and has corroboration in the literature (Brand & Powell, in press; 
Csikszentmihalyi, 1975; Csikszentmihalyi & Larson, 1984; Larson, 1984). This 
study was conducted during a summer session in which 16 weeks of regular 
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semester work was condensed into eight, with some subjects filling out as many as 
26 emotion scales during this period. The quantity of writing and the repetitive 
measurement process could induce still more fatigue and tedium. The entire group 
consisted of six skilled writers and 18 unskilled writers. We would expect unskilled 
writers to feel negative about a required writing class. Even strong writers can 
suffer academic burnout, though perhaps for different reasons. What we found 
seemed to be a related phenomenon: a persistent erosion of initial interest. Stu- 
dents were not increasingly bored; they simply felt progressively less. 


Accompanying these decreases were changes within episodes. Part of these 
changes can be attributed to the varying lengths of the factor scales. But most 
changes indicate that the students were, in fact, reacting differentially to the emo- 
tion items. Positive changes were uniformly direct and relatively strong. With one 
exception, negative passive changes were inverse and moderate. Negative active 
changes were mixed and weak, replicating the findings of Brand and Powell (1986). 
Given the intensity of the instruction and the number of scale administrations, 
however, we would expect students to experience not less but more negative active 
emotion. This was not the case. Negative active emotions were more resistent to 
change than the negative passive. 

Despite the flattening effect of affect across episodes, there were more emo- 
tional fluctuations when emotions were written into the prompt than when it was 
traditional exposition. The most statistically dynamic were the emotionally toned 
and descriptive exercises, all of which encouraged students to write about personal 
matters. Despite its middle position in the assignment sequence, writing about an 
adventurous experience produced the greatest positive increase. Even the confused 
assignment produced positive feelings. 

Except for the boring episode, the research episodes prompted the most nega- 
tive reactions. The final research paper ranked fifth in numerical increase in posi- 
tive affect. Yet three of the four episodes before it were accompanied by 
statistically significant increases in positive feelings which were numerically lower. 
For these episodes observations were available on only 16 students, the lowest level 
of participation in the study. Had the mean increase held constant with several 
more subjects, the increase in positive affect on the final research episode would 
have reached significance. In contrast, even if all 24 students had been present dur- 
ing the boring episode, with the same mean increase, significance would not have 
been reached. Clearly, magnitude of difference is not the only factor in determining 
statistical significance. Sequence of assignment is important in student response. 
So too is type of assignment. 

The pattern of emotions associated with the boring episode appeared dramati- 
cally different from that of all other tasks, emotionally toned or otherwise. Stu- 
dents were not only asked to write about a boring experience, but the term was 
directly represented on the negative passive scale. It is not surprising that the bor- 
ing experience ranked last in increased positive feelings and was the only episode 
that resulted in more negative passive feelings. Indeed, writing about something 
boring induces boredom. Emig (1971) reported that even competent student writers 
working on topics unrelated to boredom found little satisfaction when contemplat- 
ing their finished work. Like starting, finishing writing is viewed as a “mundane 
moment devoid of any emotion but indifference and the mildest of satisfactions 
that a task is over” (p. 87). Emig further explained that the lack of affect she ob- 
served might reflect an impoverished emotional vocabulary. If this were the case, 
providing students with that language would allow them to disclose their feelings. 
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In this study, the BESW gave them positive and negative emotion items alike, and 
students chose the positive. However, when they saw their writing selves as emo- 
tional, they reported more intense negative feelings. The dominant effect in this 
study was positive. But it is also possible that, regardless of their true feelings, stu- 
dents found it socially unacceptable to disclose negative affect, reminding us once 
again that emotion still has a bad name. 


With one temporary but profound exception, the classroom experiences of these 
students were apparently anesthesizing and pleasant. Although the population was 
weighted heavily with unskilled writers, their emotions stayed remarkably positive. 
However, the constellation of emotions did shift. The five top-ranking emotion 
items on the pretests were all positive: happy, interested, adventurous, excited, and 
inspired, in that order. Posttest findings still showed only positive emotions as the 
top five. But the items, adventurous and inspired, were replaced by relief and satis- 
faction, both of which tied for highest intensity, confirming a finding reported in 
the earlier study (Brand & Powell, in press). 


Positive feelings at the end of writing seem to refer to a complex of pleasant 
emotions of which relief and satisfaction are a part. But relief does not necessarily 
refer to release from a negative situation. It may be linked to a sense of closure that 
a task is done satisfactorily. Significant relationships were observed between satis- 
faction with the product and six posttest states. Writer satisfaction was directly re- 
lated to pleasant feelings after writing, regardless of the assignment and, except for 
the research episodes, inversely related to unpleasant feelings. This finding indi- 
cates that good writing is linked to good feelings and weak writing is linked to bad. 
We also hypothesize that satisfaction may not be so much an effect as a cause. 
Dissatisfaction with a product may induce negative feelings, and satisfaction with a 
product may induce pleasant feelings. To be sure, pleasant feelings are not 
synonymous with quality writing; in this study there was a high proportion of un- 
skilled writers, who, by definition, do not do satisfactory work. However, they are 
more easily satisfied with it (Flower & Hayes, 1980). They would also be expected 
to exercise poor judgment about what constitutes their best writing. Perhaps they 
reason that, because they work hardest at it in composition classes, their writing 
ought to be good regardless of its actual merit. 


The measure of writing achievement is reflected in the skill ratings. Supporting 
the findings of Brand and Powell (1986), unskilled writers, regardless of rater, re- 
ported more intense negative affects—which makes sense. On the other hand, the 
longer students had been writing on their own the more positive they tended to feel 
after they wrote and almost always before. These students were also less hostile to 
the formal school activity, a finding that agrees with Brand and Powell. 


Self-sponsored and skilled writers alike reported generally lower negative affect 
on all but the research assignments. Students accustomed to writing on their own 
felt better about it as did skilled writers. This has its virtues and drawbacks. 
Having learned to function in an environment that prizes passivity, these students 
approach their writing in a businesslike, matter-of-fact manner. However, because 
the school context so dominates their task environment, they are not likely to expe- 
rience the emotions professional writers find useful in the writing process (Brand, 
in press). 

Subdividing skilled writers by rater, self-rated skilled writers reported substan- 
tially more positive feelings on the affectively toned episodes than the instructor- 
rated skilled writers but weaker positive feelings on the traditional assignments. 
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Relationships Between Types of Assignments 


Our sense is that writing teachers generally value traditional exposition and com- 
municate it. It stands to reason that they would rate as skilled students who per- 
form well on these types of assignments. These students would, in turn, feel good— 
a phenomenon which we call the goody-two-shoes effect. All in all, we found a more 
discriminating picture of affect when skill was defined by the students themselves. 
So it seems that self-perception of skill is a more sensitive predictor of emotional 
response to academic writing than instructor-rated skill, again confirming the earli- 
er study (Brand & Powell, in press). Examining the relationship between writing 
achievement and emotion can be improved by increasing the number of intervals 
on which writing performance can be measured. 


Considerable research is being conducted to describe writers’ cognitive pro- 
cesses. Affective outcomes are a long-standing educational objective, but they have 
seldom received systematic attention and certainly not in writing theory and re- 
search. Furthermore, there is virtually no body of knowledge on the interaction of 
cognitive and affective processes in a full range of academic skills, much less in 
writing. 

The present study is a second effort to describe relationships among emotion 
variables potentially important in the highly cognitive process of writing. These 
findings, together with those obtained on the intellectual patterns of writers, 
suggest that complex interactions occur during the writing process, which are at 
once cognitive and affective, physical and psychological, and enabling and disa- 
bling. Assignment topic and type influence these outcomes in much the same way. 
Emotions, cognitions, and achievement are sensitive to teachers’ views of their stu- 
dents as writers and students’ views of themselves. Emotions are guided not only 
by the nature of the task but also by the very participation in it. If little else, we 
surely need to reconsider the impact of academic writing situations on student re- 
sponse. 


Notes 


LE A complete description of the scale design and development as well as the factor analysis, cross-validation studies, and tests 
of reliability may be obtained from Dr. A.G. Brand, Department of English, 8001 Natural Bridge Road, University of 
Missouri—St. Louis, St. Louis, Missouri 63121-4499. 

2: Although the research paper is clearly a traditional expository writing assignment, because it was a longer, more complicat- 
ed undertaking, it was addressed separately. 
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Mainstreaming Exceptional Children: 
Teacher Attitudes and the 
Educational Climate! 


The attitudes of classroom teachers toward exceptional children and the notion of 
mainstreaming are crucial, at least as important as teacher competencies, curricula 
modifications, or administrative strategies. Many variables affect teacher attitude 
formation and maintenance, but the relative impact of each remains imprecise. This 
study used a 25-item Attitude Survey to assess teacher attitudes toward current phi- 
losophies and concepts regarding mainstreaming. Results indicated that in Australia, 
where integration ts a rather new and not wholly accepted educational policy, teacher 
attitudes were significantly less favorable than were those of two Canadian teacher 
samples. 


The past 30 years have witnessed an increasing interest and involvement in special 
education in countries in all stages of development. Generally, the creation of soci- 
eties in which handicapped persons can enjoy productive and fulfilling life styles is 
now seen as an imperative, rather than a dream (Ward, 1980). 

In Canada the integration of exceptional children into the regular educational 
milieu is now policy. Currently, six provinces have some form of mandatory legisla- 
tion regarding exceptional youngsters. The other provinces and the two territories 
have permissive legislation, implying that they may provide services but are under 
no legal obligation to do so (Goguen, 1980). 

However, across the country, the general issues of children’s rights, individual 
education plans, and least restrictive environments are being resolved (McMurray, 
1980). Mainstreaming is the most prevalent pattern of instructional organization 
for special education programming. Nearly 77% of Canadian exceptional children 
are placed in integrated classroom settings (Council of Ministers of Education, 
1983). 

Approximately 15.5% of the Canadian school age population will require special 
education services at some point during their school careers (Council of Ministers 
of Education, 1983). Similarly, Australian educators estimate that 15% to 16% of 
children need special education services (Ward, 1980). 


Dr. Winzer is with the Department of Educational Psychology where she teaches courses in special education. 
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In recent years in Australia an increasing interest in, and commitment to, spe- 
cial education has been fueled by a number of federal and state documents (Cohen 
Committee, 1973; Interim Committee for the Australian Schools Commission, 1973; 
Senate Standing Committee on Education, Science and the Arts, 1972; Senate 
Standing Committee on Health and Welfare, 1971; Victoria, Department of Educa- 
tion, 1984) along with nongovernment reports (Rehabilitation International, 1979; 
Rigby, 1973). The issue of integration or segregation has become one of the major 
contemporary issues of Australian special education. 


It is now Australian policy to provide a free and appropriate education for ex- 
ceptional children in the regular schools. However, while integration is an 
Australian-wide policy, it remains a matter of philosophy rather than practice 
(Morris, 1982). Although legislation has been discussed and promoted (Steinberg, 
1980) none is yet in place. Indeed, Andrews, Elkins, Berry, and Burge (1979) con- 
cluded that, as far as special education was concerned, the Australian legal system 
was in considerable disarray. 


As the explicit expression of the principles and ideas of normalization and 
mainstreaming become increasingly manifest in practice, exceptional pupils are no 
longer solely the responsibility of special educators, maintained in self-contained 
classrooms, special schools, or institutions. Rather, many exceptional youngsters 
are now within the orbit of the public schools, placed in regular classrooms, and the 
responsibility of regular classroom teachers. Hence much of the success of integra- 
tion depends upon the preparation and training of regular classroom teachers, the 
availability of support services and, of prime importance, the teachers’ attitude 
toward the children and their acceptance of responsibility for their education 
(Drummond, 1978). 


It is generally accepted that teacher attitudes can positively or adversely affect 
student achievement, student behavior, and teacher behavior. The attitudes and 
subsequent expectations held by classroom teachers are crucial, at least as impor- 
tant as teacher competencies, curricular modifications, or administrative strategies. 
But a body of literature has demonstrated that regular classroom teachers are not 
overwhelmingly supportive of the notion of mainstreaming. They are ill-prepared 
for the enterprise, both in attitude and knowledge. They harbor generally unfavor- 
able attitudes toward the concept and process, and argue for the retention of spe- 
cial classes (Barngrover, 1971; Bond & Dietrich, 1982). 


Given the crucial nature of teacher attitudes, investigators have probed various 
aspects of attitude formation and maintenance. This research has generally been 
contained within three major categories. First, studies have been directed toward 
acceptance-rejection issues in regard to specific groups of exceptional pupils 
(Curtis, 1985; Guerin, 1979; Mitchell, 1981; Moore & Fine, 1978; Shotel, Iano & 
McGettigan, 1972; Smith, 1979; Vacc & Kirst, 1977; Vandivier & Vandivier, 1981; 
Ward, Parmenter, Riches, & Houritz, 1978). A consistent finding of such investiga- 
tions is that regular classroom teachers prefer special class placement for children 
classified as mentally retarded and emotionally disturbed. If mainstreaming is in- 
evitable, then learning disabled students are preferred. 


A second body of literature has investigated the relationship of teacher vari- 
ables and attitudes toward mainstreaming. (De Leo, 1976; Donaldson & Martinson, 
1977; Guerin & Szatlocky, 1974; Harisymaw & Horne, 1975; Higgs, 1975; Larrivee & 
Cook, 1979; Smith, 1979). Areas of interest have included teacher sex, age, years of 
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experience, grade level taught, geographical location, and position in the educa- 
tional hierarchy. The results of such investigations are generally inconclusive. 

A third group of studies have investigated whether knowledge about, and expo- 
sure to, exceptional individuals can positively change attitudes (Bond & Dietrich, 
1982; Brooks & Bransford, 1971; Cronk, 1978; Donaldson & Martinson, 1977; Glass 
& Meckler, 1972; Harisymaw & Horne, 1976; Johnson & Cartwright, 1979; Larrivee, 
1981; Leyser, Abrams & Lipscomb, 1982; Naor & Milgram, 1980; Reed & Hill, 1981; 
Sanche, Haines, & Van Hesteren, 1982; Winzer, 1984b). These studies indicate that 
information and exposure are potent factors in establishing and maintaining posi- 
tive feelings toward exceptional individuals and the concept of mainstreaming. 


Fewer studies have investigated the attitudes of prospective teachers. However, 
research has demonstrated that students hold generally positive attitudes (Curtis, 
1985; Drake, 1977; Kang, 1976; Sanche, Haines, & Van Hesteren, 1982; Winzer, 
1984a). Special education students tend to be more accepting than regular educa- 
tion trainees (Curtis, 1985; Kiernan, 1974). 


Even fewer studies exist regarding whether agreement with the philosophy of 
mainstreaming, as implied by current laws and policies, may be a determiner of 
positive teacher attitudes. Ringlaben and Price (1981) found that teachers who 
tended to report strong agreement with the philosophy of mainstreaming also 
tended to report that mainstreaming was working well in their classrooms and that 
it had positive effects on integrated students, on teaching performance, and on atti- 
tudes toward teaching. Similarly, Bond and Dietrich (1982) found that agreement 
with the philosophy of mainstreaming became manifest in positive attitudes. Thus 
it appears that teacher agreement with the intent of mainstreaming may be a de- 
terminer in the success experienced by the exceptional child in the classroom. In 
addition, an educational climate that promotes mainstreaming may also promote a 
positive individual philosophy. 


Purpose of the Study 

This study was designed to examine whether the educational climate, viewed as the 
concepts and policies in popular circulation, constituted a variable in determining 
the attitudes of teachers toward exceptional children and the notion of main- 
streaming. Three geographical areas were selected as sites for the investigation. 
Two were in Canada: Ontario, which has mandatory legislation concerning excep- 
tional children; British Columbia, which has not passed mandatory legislation, but 
both policy and practice promote the integration of exceptional children into regu- 
lar classrooms. In Western Australia, the third location, integration is viewed as ac- 
ceptable policy, but no legislation is in place. 


Method 

Instrument 
In order to assess teacher attitudes, a 25-question Likert-type scale was developed. 
The Attitude Scale comprises two dimensions: A, the possible effects of main- 
streaming on children, both exceptional and normal, and B, the cost of main- 
streaming to the classroom teacher and the school. 

The development and use of the Attitude Survey has been reported previously 
(Winzer, 1984a, 1984b). Reliabilities on the present study, as determined by the 
LERTAP statistical procedure (Nelson, 1974) were adequate. Dimension A 
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produced a Hoyt estimate of reliability of .86 and B gave .79; Cronback’s alpha reli- 
ability coefficient for the total scale was .85. 


The Attitude Survey was administered during the opening session of introduc- 
tory in-service special education courses. Scale items were both positively and neg- 
atively worded. Participants were asked to mark one of five alternatives, ranging 
from strongly agree to strongly disagree. As well, information was requested regard- 
ing relatively static teacher variables such as sex, age, status in the educational 
hierarchy, years of teaching experience, grade level taught, and qualifications to 
teach exceptional children. 


Sample 
Samples were drawn from trained teachers taking additional in-service training in 
three locations: British Columbia (n=142), Ontario (n=109), and Western 
Australia (n=73). Of the participants, approximately one-third were 25 years of 
age or less and the rest were 26 or older. In each group, women outnumbered men 
by approximately two to one. 

Drawing samples from in-service special education courses immediately places 
limitations upon the study. The respondents were not representative of the public 
at large. Moreover, they may not accurately reflect the attitudes of classroom 
teachers as enrollment in a special education course may indicate a positive predis- 
position toward exceptional individuals and the notion of mainstreaming. 


Results 
The research was undertaken in order to assess whether teachers, functioning 
under different educational policies and legislation in widely varied geographical 
areas, held meaningfully different attitudes toward exceptional children and the 
idea of mainstreaming. Mean attitude scores and standard deviations were ob- 
tained, as shown in Table 1. 


Inspection of the mean scores demonstrated that both groups of Canadian 
teachers hold attitudes that lean in a positive direction. Australian teachers are less 
positive, especially in regard to the costs to the classroom teacher. A probable ex- 
planation of these differences is offered by Morris (1982) who observed that while 
Australian teachers are aware of the mainstreaming advocacy the idea is a matter 
of philosophy rather than practice. Moreover, Watts and his colleagues (1978) 
found Australian teachers to be ill-prepared for mainstreaming and unaware of 
personnel such as resource room teachers. 


Using a one-way analysis of variance (ANOVA) contrasts were computed to de- 
termine the existence of group differences. The data were examined in regard to 
teacher-related characteristics: location, sex, age, years of teaching experience, 
grade level taught, and qualifications to teach exceptional children. Table 2 pres- 
ents the summary statistics. 

The main effect for ANOVA was location. A significant difference on both di- 
mensions was found between the Australian and the Canadian teachers, in favor of 
the latter. Post hoc Sheffé statistical procedures confirmed that the Australian 
group demonstrated meaningfully different attitudes, while no statistical differ- 
ences were found between the samples from Ontario and British Columbia in re- 
gard to location alone. 
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TABLE 1 
Responses to the Attitude Survey 


Se 


Dimension A Dimension B Full Scale 
N Mean sd. Total N Mean s.d. Total N Mean s.d. Total 
Possible Possible Possible 
British Columbia 138 44.870 3.692 60 136 48.750 4.279 65 134 93.612 6.321 125 
Ontario 104 44.971 3.290 60 99 48.010 4.523 65 97 92.866 5.924 125 
Western Australia 87 43.487 4.321 60 86 44.570 4.016 65 84 87,929 6.775 125 
All Samples 329 44.523 3.798 60 821 47,402 4.617 65 815 91.807 6.749 


a aaa a aaa SSS 


TABLE 2 


Summary Statistics of Teacher Related Variables 


—_—_—_—_—_—_—— OOO eee 


Dimension A Dimension B Full Scale 
Ss df F SS df F ss df F 
Location 140.111 2 4.978* 973.588 2 26.473* 1807.750 2 22.574* 
Sex 11,445 1 0.791 8.006 1 0.377 0.204 1 0.947 
Age 5.530 1 0.383 206.006 1 9.875* 306.814 1 6.845* 
Status 52.776 4 0.913 233.311 4 2.794** 462.530 4 2.582** 
Experience 11.675 2 159 177.286 2 2.799 
Grade Level 72.286 5 .369 115.340 645 
Taught 
Qualifications 794.874 4 6.024* 593.311 4 5.142* 2602.324 4 7.070* 
*p<.01 
**p)<.05 


Other teacher related variables—sex, age, status, experience, grade level taught, 
and qualifications—were examined for the total samples. It was found that older 
teachers demonstrated more favorable attitudes, although experience gave no 
statistical effect. In three groups, school administrators showed the most positive 
attitudes confirming the findings of earlier studies (De Leo, 1976; Payne & Murray, 
1974). No differences were discovered in relation to the sex of the respondents. 

When each sample was examined for intragroup differences, the Australian 
teachers demonstrated homogeneity in their attitudes. Similarly, the Ontario sam- 
ple produced an intragroup difference only on the qualifications variable. It was 
found that the two most highly qualified Ontario subgroups held the most negative 
attitudes. British Columbia teachers produced no intragroup differences except in 
relation to grade level taught. There was a tendency for positive attitudes to de- 
crease as grade level increased. 

To further examine differences, a one-way analysis of variance, confirmed by 
Sheffé post hoc procedures, was computed on single items, as shown in Table 3. 
Location was the component used for this interpretation. With the exception of 
item 4, the Australian teachers showed less favorable attitudes on all items demon- 
strating statistical significance. Ontario teachers were least positive about item 
four. 
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TABLE 3 


Dimension A 


1. Mainstreaming the exceptional child 
will promote his/her independence. 

4, Exceptional children will find it 
much easier to mix with their peers 
after leaving school if they have been 
taught together in regular classrooms. 
9. The integration of exceptional chil- 
dren into regular classes is beneficial to 
regular pupils. 

12. Mainstreaming offers mixed group 
interaction which fosters understand- 
ing and acceptance of differences. 

14. Mainstreaming is likely to have a 
negative effect on the emotional devel- 
opment of the exceptional child. 

16. Mainstreaming will give exceptional 
students a better chance to readily fit 
into their home community. 

17. The exceptional child is likely to be 
socially isolated by regular students. 
24. Regular students quickly become 
accustomed to having exceptional 
pupils in the school and naturally ac- 
cept them as peers. 

8. The image of a particular school 
benefits from the presence of excep- 
tional children. 

13. As a teacher, I would be willing to 
have an exceptional child in my class- 
room. 

15. Classroom teachers should make 
the decision as to whether or not to 
take an exceptional child in their class- 
room. 

20. As a teacher, I would be willing to 
take extra training so as to be able to 
better handle exceptional children in 
my classroom. 

23. Teachers need extra training if they 
are to teach exceptional students. 


fy eta Ui 
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Location 


SS 


31.023 


4.506 


13.323 


8.259 


LTT 


10.602 


8.735 


7.669 


15.804 


8.1311 


191,891 


11.510 


3.434 


F 


15-130; 


3.426** 


Ss reli he 


63199" 


6.105* 


10.108* 


Dileaee 


5.013* 


6.665* 


6.273* 


66.139* 


11.842* 


3.02/7* 


Ss 


1.317 


2.396 


1.477 


2119 


3.908 


3.139 


2.219 


2.838 


5.252 


5.018 


24.557 


UTTi 


3.562 


Grade Level Taught 


F 


.316 


Mp. 


006 


1.089 


.849 


1.88 


303 


924 


852 


1.634 


2.668** 


672 


1.305 
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Discussion 
Attitudes are not a unitary dimension and cannot be captured by a single score. 
Teacher attitudes are influenced by many factors such as information level, knowl- 
edge, attainment, specific skill acquisition, contact and experience, exposure, confi- 
dence, and amount of success. The degree to which each impacts on the formation 
and maintenance of attitudes is not clear. 


Obviously, a two-dimensional Attitude Survey, as used in this study, is limited 
and not exhaustive of the domain of attitudes. Moreover, sampling students from 
in-service programs inherently biases the results. There is evidence that students 
who enroll in special education programs hold more positive attitudes than those 
who enter other educational options (Semmel & Dickson, 1966). It is likely that in- 
terest in exceptional people constitutes a major feature when in-service students 
enter special education. 


While any implications should be approached cautiously, the results are never- 
theless suggestive. Canadian teachers in this study held attitudes that are generally 
positive and reflect the idea that, at worst, mainstreaming will not have a negative 
effect on special children or classroom teachers. On the other hand, the lower rat- 
ings of the Australian teachers would suggest that, despite efforts to promote main- 
streaming, negative attitudes are still held by many professionals. These bode 
poorly for the success of mainstreaming and can have serious implications for 
student success and achievement. However, in Australia provision is being made to 
educate exceptional children in regular classrooms. Teachers are expected to de- 
velop the competencies to teach heterogeneous groups of students. The policy 
changes are reflected in teacher education programs and, as more special education 
offerings are made available at the preservice in-service levels, it is to be expected 
that the degree of commitment to special education will increase. 


Note 


cle This project was funded by a grant from the Educational Research Institute of British Columbia. 
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This article critically examines contemporary schooling in rural Ghana. It notes the 
problems that have arisen out of the superimposition of the school as a social institu- 
tion on the indigenous education that villagers practised prior to the former’s intro- 
duction. Arguing for a structural transformation of rural village education, an 
alternative educational model—referred to as “community education-as-an-enter- 
prise”—is proposed to replace the present schooling system. Based on the notion of 
indigenization of education, the model incorporates the principles of the unity of 
education and work, and the participation of the community in shaping the program, 
process, and practice of its own education and development. 


Adams and Coulibaly (1985) recently described the indigenous education systems 
of some West African village communities as being “community-based.” Similarly, 
Wilson (1963), Busia (1964), Scanlon (1964, pp. 1-12) and McWilliam and 
Kwamena-Poh (1975) have noted the community-oriented nature of indigenous 
education in the centripetally organized communal societies of Ghana and other 
African countries. Basically incompatible with the structure and mechanisms of 
formal education, this African indigenous mode of community education, and espe- 
cially its socially integrating qualities, has been lost in the transition to the West- 
ern schooling system. As a consequence of this loss, the village school finds itself in 
constant collision with its community (Roberts, 1982). The frequent confrontation 
between the two has severely curtailed a role for the school in community affairs, 
particularly in village development efforts. In fact, widespread dissatisfaction with 
the performance of the school among school leavers, parents, and successive Gha- 
naian governments is directly or indirectly linked to the village school’s inability to 
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relate to the community and thus respond to its development needs. This inability 
underscores the need to find ways of reviving the principles and practice of indige- 
nous African community education to better integrate the village community and 
its formal educational institution for the purpose of village development. 


This article proposes the revival of African community education as a means of 
regaining its benefits for the rural Ghanaian village. What follows is a suggested 
framework which incorporates the relevant elements of both schooling and indige- 
nous education and, at the same time, integrates education into the community as 
an inducement for the latter’s progress and well-being. It proceeds by briefly 
outlining the dysfunctions of Ghanaian village schooling. Then the compatible 
aspects of both schooling and indigenous education are welded together into a new 
framework or model called “community education-as-an-enterprise.” 


Failures of the School 

The dysfunctions of Western formal education in Africa are well documented by 
Manuwuike (1978). Other writers have criticized the African school for being a do- 
mestication device (Freire 1972), a tool of cultural imperialism (Carnoy 1974), and 
an agent of colonial domination (Altbach and Kelly 1978). President Nyerere 
(1967) views the school as fostering the cultural alienation of villagers, while 
Fafunwa (1973) regards it as a push-factor for rural-urban migration—a phenome- 
non which siphons the human capital of the village into the city, thereby widening 
the gap between the two in terms of the distribution of wealth. 


Studies which focus on schooling in Ghana confirm many of these assertions. 
The research findings of Weis (1979, 1981) and Williamson (1979), for example, re- 
veal tendencies toward the reproduction of inequalities through school stratifica- 
tion and the selection mechanisms of schooling. Foster (1965) had earlier reported 
disparities in rural and urban primary enrollment. From his findings, Blakemore 
(1975) links early attrition in village schools to parents’ frustrations over the lack of 
returns relative to their sacrifice in sending their children to school. Invariably, 
some parents withdraw their wards prematurely from school for this reason. And, 
based on evidence from her own study, Browne (1981) argues the inability of the 
school to encourage in village pupils the internalization of skills that relate to 
village production systems. These findings indicate limits to the school’s ability 
both to achieve universal education in Ghana and to contribute to Ghanaian village 
development. 


The school is criticized in other ways too. The high cost of formal education, 
usually a quarter or more of total government budget allocations, is a source of con- 
stant criticism from politicians and bureaucrats. Parents complain over the loss of 
investment in their unemployed school graduates, while the frustrations of the 
school leavers themselves become evident as soon as they realize that the high ex- 
pectations and tastes which they cultivated while in school are unattainable. Be- 
cause the school is divorced from work, it can also be criticized for making it 
impossible for learners to earn income, hence aggravating the individual’s earnings 
foregone during the unusually long Ghanaian schooling years. There is also a wide 
gap between in-school cognitive learning and out-of-school affective and practical 
learning. 


By implication, these basic problems of schooling (curriculum irrelevance, sepa- 
ration of learning from earning, high financial costs, faulty delivery methods, and 
structural incongruency), which are exacerbated in the village situation in Ghana, 
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are the results of the isolation of the school from the rest of the village. It follows, 
then, that for as long as the school is effectively estranged from its host commu- 
nity, it will continue to be unresponsive to the other problematic areas of village 
life, particularly the latter’s social and economic development. 


Educationists like Minzey and LeTarte (1979) suggest, within the western edu- 
cational framework, that this problem of integrating the school into its community 
so as to create a role for the school in its community’s development is achievable 
through community education. Indeed, the concept of community education, which 
is examined next, seems to articulate adequately the educational strategies that are 
important in efforts geared toward regaining the functional benefits of African in- 
digenous community education that once characterized Ghanaian village communi- 
ties. 


Conceptual Perspectives 

Both community education and community development, as contemporary educa- 
tional concepts, originated in Western industrial societies where schooling is often 
criticized for its role in fostering alienation—a preeminent social characteristic of 
those Geselleschaft societies (Ashcroft, 1975). The school is considered a dehuman- 
izing institution, an active partner in the creation of urban communities that lack 
humanistic attributes such as caring, sharing, and self-help. This concern over the 
alienation of the individual from the community raises doubts about the relevance 
of the concept of community education in the conceptualization of education in 
rural Ghanaian villages. For those communities are characterized by strong kin 
bonds which facilitate a homogeneous community in which caring, sharing, and in- 
terdependence are natural attributes. However, these village communities experi- 
ence their own cultural and environmental alienation of the schooled minority who 
are often forced by their formal education to abandon the village for the city in- 
stead of staying to contribute to its development (Nyerere, 1967). The concept is 
equally relevant to Ghanaian rural education because it also professes educational 
relevance (Sinclair, 1980), mobilization for self-help projects, and the active partici- 
pation of the community in its own education. 


Community Education 
In their book Community Education: From Program to Process to Practice, de- 
scribed by Taylor (1984) as the blueprint of the mainstream community education 
movement in Western societies, Minzey and LeTarte (1979) consider community 
education not only as an educational concept but also as a movement, a philosophi- 
cal concept, a program, and a process. They define community education as: 
a philosophical concept which serves the entire community by providing for all of the 
educational needs of all its community members. It uses the local school to serve as a cat- 
alyst for bringing community resource to bear on community problems in an effort to de- 
velop a positive sense of community process toward the end of self-actualization. (p. 12) 
Jackson (1980) and Taylor (1983, 1984), among others, have criticized this in- 
terpretation of the concept. They question whether community education is really 
a radical panacea for the solution of the problems associated with the school, or 
whether it is, in fact, a meliorative device designed to placate the poor through 
token programs in order to divert attention from the real issues of educational in- 
equalities, inefficiencies, and ineffectiveness. These critics also see community edu- 
cation, when conceptualized this way, as more expressive of the peripheral nature 
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of the “add-on” extracurricular activities that supplement the formal school curric- 
ulum. They further argue that the role expectations of the school do not include 
enhancing the achievement chances of the underprivileged. In fact, the Minzey and 
LeTarte definition implies school-centered community education and, although the 
protagonists of this school-based community education (including Minzey and 
LeTarte) will dispute it, the interpretation makes community education inter- 
changeable with “community schooling.” 


Nevertheless, along with other proponents of school-centered community edu- 
cation, Minzey and LeTarte also acknowledge in community education implicit ele- 
ments of educational relevance, a capacity to engender self-reliance, a propensity to 
encourage caring and sharing, opportunities for community mobilization for com- 
munity action, and built-in problem-solving potential in its process. It is these 
aspects of the concept, rather than its school-centeredness, which are considered in 
community education as it is conceptualized in this article. 


Community education is, therefore, defined here as an educational process that 
encourages the utilization of all learning resources within the community (even 
when there is no school), for the purposes of mobilizing the community’s resources 
for its own development. The definition reiterates Quainoo’s (1955) observation 
that school generated learning is only part of the total learning that occurs through 
the network of educational agencies in the village; and it underlies the need to inte- 
grate the school into the community system. In addition, the interpretation recog- 
nizes the need for a learning model which effectively converts the school system to 
an education system (Fantini, 1978). Again, this conceptualization affirms the un- 
derlying need, noted by King (1976), to recognize the differences between the social 
conditions of Western industrialized societies where the concept originated, and 
those of the developing societies, like Ghana, where a village may not have a school, 
and where also the educational focus is on functional basic education. Finally, giv- 
en the adoption of a suitable educational delivery strategy such as Freire’s 
problematization and conscientization which favor a liberating approach to com- 
munity development, the definition incorporates the capacity of community educa- 
tion to engender community development, a term which is discussed below. 


Community Development 

The significance of the process of community education lies in its ability to effect 
qualitative social change in the form of seeking solutions to the community’s prob- 
lems (Minzey & LeTarte, 1979). Fletcher (1980) explains that this qualitative 
change may be achieved through programs of compensation or positive discrimina- 
tion by which a community education process may target a social problem such as 
poverty, alienation, or inequality of educational opportunity and tailor its programs 
toward the solution of the identified problem. 


Armstrong, David, and Doyle (1975) describe this approach, which they per- 
ceive as interventionist, as target-centered and they distinguish it from their pre- 
ferred man-centered alternative approach. In their view, the man-centered 
approach to community development is essentially an educational process, a view 
which ties in with community development as an integral part of community edu- 
cation. Community development is seen in this article in that perspective. The 
term is thus construed as the processes of a community’s self-examination through 
its education for the purposes of improving its living standards by providing its 
members with the necessary skills for the utilization and transformation of the 
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available resources to its benefit. This meaning discards the notion of intervention 
suggested by Fletcher and deemphasizes the role of external change agents. The 
definition instead makes room for education in the efforts to provide for the social 
advancement and well-being (Ross, 1967) of the village community. Thus, although 
community development is interpreted here from the Minzey and LeTarte self-help 
approach, the emphasis is on indigenously stimulated development and not exter- 
nally induced development. 


Community Education for Community Development 

Many views on community education for community development have been ex- 
pressed. Sinclair (1980) emphasized the need to relate the design of basic schooling 
to the life and work of the wider community. Dove (1980), Watson (1982), and 
Thompson (1983) see a role for a village community school in rural transformation 
within social conditions such as those prevailing in rural Ghana. They envision the 
village school as a rallying point for community mobilization and participation in 
change efforts that will engender improvements in the social, economic, and health 
conditions, among others, in the village. Watson (1982, 1983) also argues that there 
is a specific role for the village school teacher in village education and develop- 
ment. 


Writers such as these also note preconditions for the successful translation of 
these ideas into viable village education and development schemes. One such pre- 
condition is the active involvement of the village leadership in the processes of 
community education for development. Dove (1980) cautions that government and 
international agency initiated interventionist community education schemes, which 
are developed without the active participation of the local community leadership, 
risk failure. The conflict of authority between the school and village institutions 
described by Roberts (1982) underlines the serious consequences of ignoring village 
decision and opinion leaders in the planning and implementation of village commu- 
nity education and development schemes. Watson (1983) finds the present restric- 
tion of teacher training to the acquisition of methods for imparting literacy and 
cognitive skills exclusively to village school pupils inadequate preparation for a de- 
velopment inclined teacher. Therefore, he recommends that the prospective village 
teacher’s training be extended to cover community development skills if a leader- 
ship role is to be fulfilled. It is important to note, however, Roberts’ (1976) asser- 
tion that the teacher’s leadership role in the Ghanaian village, considered vital to 
village education and development by Watson, is diminishing with a continuing de- 
valuation of his social status. As important and as apparent as these factors might 
seem, British colonial attempts at village community education for development in 
Ghana did not appear to have recognized them. 


Colonial Community Education 

Mgbemena (1983) contends that British colonial educational policy, which centered 
around the school, was not particularly enthusiastic about tailoring schooling to 
suit local needs. Nevertheless, persuaded by the American initiated Phelps-Stokes 
Reports of 1922 and 1925 on education in Africa (Manuwuike, 1978, pp. 15-16), the 
colonial administration introduced some kind of “education for the welfare of the 
community” through programs outlined in policy guidelines such as the Education 
Policy in British Tropical Africa of 1925, the Memorandum on the Education of 
African Communities of 1935, and Mass Education in African Society of 1944. 
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Implemented “symbolically” through extracurricular activities such as cultural 
drumming and dancing ceremonies, the schemes did not achieve the expected in- 
volvement of school leavers in village development. Schooled villagers still flocked 
out of the village. Neither did the vocationalization of the school curriculum, by in- 
troducing subjects like gardening, mitigate the role of the school in alienation and 
educational irrelevance. For, as Foster (1966) predicted, mere hotch-potch adjust- 
ments and disparate uncoordinated changes to curriculum content in the manner 
of vocationalization were not enough to make schooling more meaningful to Gha- 
naian village communities. 

The failure of British efforts to link school (curriculum) to life in the village 
attests to the inability of the school to relate to the village and contribute to its de- 
velopment in contemporary times. This is not to say that education (not schooling) 
cannot play a positive role in village development. The inability of the colonial 
educational efforts to induce village development only proves the validity of the 
preconditions referred to earlier and also serves as a guide for future attempts at 
community education for village development—a search which is the objective of 
this article. 


Informed by the inadequate performance of the British in community educa- 
tion, the search here for a new approach to community education for village devel- 
opment focuses on a less formally structured education that marries school and 
work, integrates learning and life, is delivered through dialogical discourses, and 
thus creates an enabling role for education in village development. Indeed, an ex- 
amination of Ghanaian village conditions, briefly attempted in the ensuing section, 
would suggest that even a more radical approach to community education induced 
village community development is necessary. 


Characteristics of the Village Community 

The village community in Ghana is almost a prototype of the hypothetical 
Gemeinschaft. It is also close to Bell and Newby’s (1971) typology of a small, iso- 
lated, homogeneous group of people with a strong sense of group solidarity; but not 
a nonliterate population in the Ghanaian case because, as Roberts (1976) discov- 
ered, every village is guaranteed the presence of some elementary school leavers as 
a consequence of the massive expansion of schooling facilities in the country since 
1951. 


More explicitly, the Ghanaian village community today is a geopolitical unit 
with a group of people living in the same place, with common values and history, 
bound together by a multiple of economic, social, and kinship ties (strong enough 
to the point of mechanical solidarity), and many of whom are descendants of 
people who also lived in that place.’ 


The dominant economic activity in the village is small-scale farming or fishing 
on an owner-operator basis. All other economically oriented activities, including 
herbal medicine practice, are part-time off-season supplements to the major in- 
come from agriculture. 


Politically, the village is a unit in the traditional centripetally organized com- 
munal system of an aggregation of villages called a paramountcy. The village is au- 
tonomous in its internal politics (Roberts, 1976). It elects its own Chief or Odikro 
who, together with a Council of Elders, constitutes the highest authority in the flat- 
topped pyramid village political structure. 
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However, the village authority does not exercise jurisdiction over the school 
(staff and pupils)—wherever it exists—following its geographical and social isola- 
tion deliberately created by the original architects, the missionaries, to prevent 
“uncivilized” village practices from contaminating the “purity” of the school 
(Zarembka, 1969). 


Education in the Village 

The school, where it exists, is only part of the village education system and only 
one factor among the needs of villagers (Quainoo, 1955). Far from achieving univer- 
sal literacy, in most cases, a majority of villagers acquire the skills that enable them 
to participate actively in the village and national economy through the indigenous 
education system.’ 


Villagers distinguish between nimdee (knowledge) and nyansa (wisdom). The 
former refers to skills or hard facts, routine arrangements of facts, and the applica- 
tion of systematic methods to resolve a problem or perform a task, often in con- 
junction with physical power. This knowledge may be innate, endowed, or 
deliberately taught and learned, and is universally accessible. Among professionals 
such as healers, blacksmiths, hunters, and linguists, however, nimdee is deeper and 
specialized, requiring extranormal aptitude and methodology in its acquisition. 


Nyansa is a rare, higher order, and more complex knowledge. It is partially ex- 
periential, but not everyone has the ability and capacity to aspire to achieving it. 
Although to some extent gerontocratic, it is not necessarily synonymous with age. 
The skills of nyansa are important in adjudication and political decision-making. 
The typical wise person is the linguist, an orator and an authority on the local lan- 
guage and the intermediary in communication between the Chief and his or her au- 
dience. 

Due to exhaustive investiture rites and procedures, political officeholders are 
generally deemed wise. There are no institutionalized forms of restricted knowledge 
or wisdom acquisition. In the formal school systems of Africa inherited from the 
colonial era, however, this folk wisdom “is conspicuous by its absence” (Adams & 
Coulibaly, 1985). 

Either (nimdee or nyansa) may be acquired or transmitted through a combina- 
tion of family, peer group (age grade), asafo companies (militia), or story-telling— 
together constituting the network of community education. Each institution, as 
outlined by Bentsi-Enchill (1971), is assigned a specific role and function in the 
child’s education and upbringing. Parents pass on the basic skills for economic 
survival to their offspring who learn through play, imitation, and role-playing. 


Age grade or peer group maturation confinement sessions (akin to the school’s 
formality) prepare minors for the challenges of adulthood. The asafo company in- 
culcates into boys the ideals of sacrifice, courage, community solidarity, and re- 
sponsibility. Special lessons in local history, oratory, traditional and customary 
rites are always given to Chiefs—and Queen Mothers-elect to provide them with 
the tools for wise counseling and leadership. Anansesem, or story-telling, teaches 
morals. Village assemblies and festivals provide occasions for the exchange of ideas 
and knowledge in local and extravillage discourses on topics like new farming tech- 
niques or genealogies. 

It was this elaborate, well organized education system, referred to as indigenous 
education, that the school came to subvert rather than supplement, complement, or 
even coexist with. The school has, in fact, thrived on its intrinsic contempt for the 
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social and cultural institutions of the village, even though this is unnecessary in 
light of the fact that the two systems have identical objectives of preparing commu- 
nity members for life. 


Integrating Indigenous Learning and Schooling 

Indigenous modes of learning and schooling are complementary in many respects. 
Villagers need the techniques of reading, writing, numeracy, and recognition which 
are essential for calculations and other daily economic activities they indulge in. 
But these skills, which are basic tools for the acquisition of the highest order of 
knowledge and are taught by the school, are missing from indigenous African edu- 
cation (Busia, 1964). 

Learners, however, equally need their education to be related to their environ- 
ment in the manner guaranteed by indigenous education, to enable them to control 
the latter rather than abandon it. Wilson (1963) identifies another of the positive 
elements of indigenous education, its methodology, which he believes to be essen- 
tial for the achievement of the powers of memory and learning. McWilliam and 
Kwamena-Poh (1975) lay partial blame for faulty teaching methods in African 
classrooms on the teachers’ inability to incorporate indigenous methods of remem- 
bering and recalling into their teaching strategies. Majasan (1975) is also optimistic 
about the enhancing capabilities of indigenous delivery methods in contemporary 
teaching. Adams and Coulibaly (1985) identify positive concepts related to the ob- 
jectives, methods, and learner characteristics such as dctive pedagogy, productive 
pedagogy, and integrated learning from the traditional community-based education 
in Ivory Coast, Guinea, and Mali which can be borrowed by contemporary African 
education systems. These examples, although few, indicate a basis for an integrated 
indigenous and school learning. 


The search here is, however, not for a colonial type nominal or symbolic incor- 
poration of a few isolated components of indigenous education into the mechanisms 
of schooling. It is for a basic structure of education that derives from indigenous 
educational structures and institutions which are community-based rather than 
from the school. The model of community education that engenders community de- 
velopment to be considered here, therefore, entails a revival of the philosophy, ob- 
jectives, tasks, methods and the general modality of indigenous Ghanaian 
education in response to Bentsi-Enchill’s (1971) challenge to Ghanaian and African 
educationists to develop modernized versions of traditional education. In a sense, 
the model-building may also be considered an attempt at the indigenization of 
Ghanaian education (Ansu-Kyeremeh, 1985). 


The Model 

Philosophy for Village Education for Development 

Among general thoughts on community-based education considered here are those 
of Dewey (1976), Freire (1970), and Illich (1971). John Dewey’s concern for a func- 
tional education that trains the child to respond to the challenges of his environ- 
ment and participate actively in community effort to meet these challenges is 
extant in his writings. Of interest here, though, is his stress on the need to relate 
education to the immediate surroundings of the learner.‘ 


Paulo Freire’s (1970) view that education must recognize the fact that man as a 
being of praxis is capable of assessing his environment and reacting to it through 
action and reflection, in the way of positive utilization of the resources available 
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within it, is equally relevant to community-based education. Freire believes that 
the extent of reaction in the above circumstances will usually depend on the degree 
to which man is aware of his potential. He further posits that the desired awareness 
is realized through problematization and conscientization from a dialogical method 
of education that guarantees the active participation of learners in their own edu- 
cation. 


Ivan Illich (1971) proposes the disestablishment of the school; a proposal which 
has been criticized as unrealistic (Dore, 1976). Yet the alternative of “learning 
webs” Illich suggests as a replacement for the school seems to fit well structurally 
into the village community education-as-an-enterprise formulated in this article 
which seeks to redefine the concept of school so that education (general and prac- 
tical) is made possible in villages without schools. 


These ideas are considered together as the philosophical basis for the design of 
village education that mirrors the structure and ideals of the village and serves the 
purposes of its community. 


Objectives of the Proposed Model 

It is from the basic tenets underlying the above philosophies of education that the 
principal objectives of the community education model under discussion here are 
derived. The tenets are: That education must relate to the environment and the 
needs of its clientele;> that rigidly structured learning processes are incapable of 
creating a congenial learning atmosphere within the community typology being 
dealt with; and that the active participation of the community in the design and 
implementation of an education scheme for the benefit of its members is of para- 
mount importance. It is discernible from the earlier discussion on the elements of 
community education and community development that these are necessary ingre- 
dients in education designed to elicit development in the autochthon community. 
The tenets also underline the task for the model which has an overall objective of 
organizing a learning atmosphere in the village, which makes universal functional 
education for children by the age of maturity manifestly feasible, while offering op- 
portunities of access to basic education for adults in the short term. Similarly, they 
express the other objective for the model which is uniting education and work so 
that learners can simultaneously earn a living. 

The proposed model is also designed to transform the community by reshaping 
the consumption patterns of villagers to accommodate and stimulate patronage for 
locally produced goods and services. By implication President Nyerere’s (1967) idea 
of education for self-reliance is an equally attractive basis for the model. 

Furthermore, any progressive village community education system must aim at 
reversing the present domination of village life by the concept of school which pre- 
vails even in villages without schools. It must equally ensure greater interaction of 
learners and instructors through a two-way communication flow in which exchange 
of ideas benefits learners and exposes instructors to the problems of learners. 

Also, since the thoughts and actions of people are usually shaped by the social 
contexts in which they grow up and live, the new approach to the education of 
villagers must incorporate the coexistence of competitive individualism and cooper- 
ative collectivism, a feature of the traditional village community which is 
atrophying and may probably die if not salvaged soon. The kind of educational 
structure that will effectively pursue these aims is described next. 
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Structure of the Model 

To achieve the above objectives, the model is loosely structured to allow greater 
geographical and subject mobility of learners and to offer them the opportunity to 
try everything on the curriculum. In theory, it is a 22-year program, including a 
one-year compulsory full-time work period. In practice, however, it is lifelong edu- 
cation and it permits flexibility in advancement from one stage to another. Except 
for the one-year period of compulsory work, part-time work is required of all full 
time learners from age 10 until they are ready to work full-time. The deschooling 
structure is justifiable because, as Foster (1966) notes, the schools will detach chil- 
dren from the rural environment largely irrespective of what they teach. 


The structure (see Figure 1) is five-tier with each tier known as a “level.” Level 
One begins at age two and ends at age eight. There are two subsections within the 
level. The first section, the equivalent of the present nursery and kindergarten, 
commences with two-year-olds and it is of four years’ duration. Play as a learning 
methodology is emphasized at this stage, and in addition the child is introduced to 
personal hygiene. At the preparatory stage, the second section of Level One, help- 
ing children through reading, writing, and numeracy is the major cognitive activity. 
Children are also encouraged to accompany their parents on visits to their farms or 
to other work places to engage in experiential and affective learning. This will help 
solve the widespread problem of attrition caused by parents’ forcible withdrawal of 
their children from full-time learning so that they may help their parents on their 
farms (Kabba-Diallo, 1983). 


Age eight is the entry point for Level Two education, which takes the child 
through eight years of study to the first exit point in the system at age 16. 
Together with Level One, the two Levels constitute the basic and compulsory seg- 
ment of the community education-as-an-enterprise model. Thus the main preoccu- 
pation of village education becomes the ultimate conversion of two-years-olds into 
literate and educated members of the work force within a period of 14 years. At 
least four out of the eight-year Level Two education is to be spent by urban born 
and domiciled children in the village. 


By the time the child graduates from Level Two, the expectation is that he or 
she ought to have gathered enough theoretical knowledge and practical skills to 
enable him or her to earn a living. The 16-year-old thus proceeds to undertake a 
mandatory one-year period of work at the end of which he or she has the option to 
enter tertiary education for specialized learning. Individuals who successfully com- 
plete the basic education and opt for Level Three, specialized learning, are automa- 
tically guaranteed places in the tertiary institutions. Those staying on in the work 
force must enroll in a part-time Level Five program. The rationale behind this ar- 
rangement is to encourage the affected individuals to keep their minds active, 
maintain their aptitude for learning, and reduce loss of memory, in case they de- 
cide later in life to return to full-time Level Three studies. 


For full-time post-Level Two education aspirants, learning resumes at Level 
Three at the end of the one-year compulsory work. Third Level learning is tailored 
to the needs of middle level manpower training similar to the semiprofessional 
preparation presently undertaken by polytechnics and teachers’ and nurses’ col- 
leges. Despite being the foundation for Level Four academic education, to forestall 
future dropout problems, all Level Three learners ought, as a matter of policy, to 
specialize in a vocation. The fact that “learning clusters” or institutions at this level 
emphasize specific areas of study might compel individuals to relocate geographi- 
cally. Part-time work, which is built into the entire education system, is particul- 
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Figure 1: Structure of the Village Community Education-as-an-Enterprise Model 
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arly important for the practicum or on-the-job learning aspect of the Level Three 
curriculum. 

Evaluation at this juncture, as in earlier cases, will continue to be based on the- 
oretical learning, practical skills, and learner contributions to socially useful work 
and economically productive ventures (Nyerere, 1967). This will ensure that meas- 
urement of competence or attainment is real and corrects the current practice of 
misrepresenting certificate examinations as selection examinations (Quansah, 
1985). Manual labor is considered an inseparable part of the work-study program 
that the model envisages as a means of inculcating the dignity of menial tasks into 
every member of the village community and as citizens of an agricultural nation 
with a labor intensive economy. 

The terminal point of Level Three learning coincides with the second exit point 
in the system. Level Three graduates may, therefore, choose to conclude full-time 
studies. But some may wish to advance to Level Four academic and professional 
education, having concurrently been prepared for it at the third level. As in the 
case of Level Two graduates, those terminating their full-time studies at this point 
must enroll in Level Five study. For Level Three education, then, the average 
learner enters at age 17 or older, and completes at age 20 plus. It must be empha- 
sized that all ages stipulated here are approximations because the individual may 
progress through the various levels freely and in accordance with his or her apti- 
tude for prolonged learning, ability to learn, and degree of motivation. 


Officially, full-time study in the proposed system terminates at the end of Level 
Four. Level Four learning is the equivalent of contemporary university education. 
Learning tasks at this level are to be restructured to include a work program and 
an indigenized curriculum that responds more positively to the needs of a nation 
with a 60% rural population and which relies on its village communities for its 
political and economic strength. The duration for Level Four education is also four 
years. The entry age is around 21, while the age of completion is 24 or thereabouts. 
Mandatory part-time or intrastudy work, to be undertaken in the village, is an es- 
sential part of the learning program at this Level. 

Learning at Level Five is informal, continuous, and lifelong. Any learner termi- 
nating full-time learning at any of the three exit points automatically joins the 
mandatory Level Five learning activities. These activities, to reflect the informal 
learning that is encouraged at this Level, are in the form of discussions and 
debates, and they are undertaken through clubs, workshops, or assemblies. The 
deployment of urban dwellers to work on village development projects, at least 
once each year, is a mandatory component of the curriculum; and so is the active 
participation of village residents in communal labor. The majority of activities, 
however, focus on personal growth, self-improvement, self-fulfilment, and self- 
actualization. Discussions on leadership, leadership roles, and national and interna- 
tional issues are also regularly organized. 


Kssentially then, the program, process, and practice of the proposed education 
model is a fusion of indigenous education and schooling. It merges the most enrich- 
ing aspects of formal, nonformal, and informal modes of education. To achieve its 
expected learning goals, a curriculum of interwoven disciplines is desirable. The 
elements of this curriculum must complement each other and also have direct rele- 
vance to the educational needs of the village communities in particular and the na- 
tion at large. This curriculum must incorporate the global dimensions of the 
learning process as well. 
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Curriculum 

A community-oriented curriculum responsive to the challenges posed by this new 
model demands a revolutionary and unorthodox approach to curriculum design. 
Four areas that immediately come to mind as natural components of such a curric- 
ulum are: theory (cognitive learning of subjects or disciplines), practicals (experi- 
ments, participatory observations, and workshops), economic productivity (work), 
and voluntary labor (communal labor). The curriculum is so formulated to coordi- 
nate and integrate the cognitive and affective aspects of learning, a feature that is 
missing from today’s fragmented modes of learning, especially the type that ob- 
tains in rural communities. The economic productivity and voluntary labor constit- 
uents of the curriculum are needed to engender community development. 


Regarding subject organization, the basic community education stage (Levels 
One and Two) is concerned with seven area studies of groups of subjects. The seven 
broad areas are: environmental studies, physical education, mathematics, science, 
agriculture, crafts and craftwork, and languages. Branches of environmental study 
include health, economics, culture, geography, society, and technology. Physical 
education embraces sports, military training, asafo (local militia) adventures, and 
maturation exercises. Elements of the subject of mathematics are numeracy, count- 
ing, measurement, reducing figures to writing, gathering and recording statistical 
data based on local phenomena, and accounting. Mother tongue education is to be 
given priority. The local language is taught side by side with the English language 
as a subject, with its grammar, composition, creative writing, poetry, and oratory 
aspects thoroughly dealt with. 

This proposed curriculum outline is only a guideline designed to assist the local 
community to determine its own. The uniqueness of local village conditions and 
needs will dictate which components of the curriculum should be emphasized. 
However, it is important for the adopted curriculum to also reflect Gandhi’s view 
that “there can be no subject or study which cannot be correlated to craft or envi- 
ronment” and that “a teacher who cannot correlate his lesson to craft or environ- 
ment is useless to school.”© The composite of a curriculum with traditional and 
modern inputs will also facilitate the retrieval and preservation of several compo- 
nents of the indigenous culture before they disappear completely. 


Methods of Delivery 

An unconventional education system must be executed unconventionally to avoid 
an entanglement of the new with the old. Just as Machoya (1980) found it paradox- 
ical for a deformalizing school to adopt conventional patterns of structured and 
graded written curriculum, the basic principles underlying the village community 
education-as-an-enterprise can be lost in the confusion of applying conventional 
methodology. Delivery strategies that correspond to the quasi-formal education 
model, with a substantial indigenous structural representation, is more likely to 
agree and mesh with the new learning and teaching methods that are a blend of re- 
trieved endogenous methodology and flexible modern approaches. The delivery ma- 
chine is thus built from dialogue, discussion, and inquiry with minimum use of 
didactic instructional methods. 

Unlike the present system, the medium of instruction is the local language of 
the community for the duration of the basic education (Levels One and Two). 
During this time, English will be taught as a subject. At Levels Three and Four, 
English becomes the new medium of instruction. A reversion to the local language 
occurs at Level Five. 
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Organization 

Underlying the organizational structure of the community education-as-an-enter- 
prise model is the Freirian idea that the participation and control of its own educa- 
tion by the community will awaken the consciousness of its people to their 
environmental problems and stimulate their solution. Problem-solving is an inte- 
gral part of its methodology. But, to facilitate the active participation in its educa- 
tion organizationally, a Village Education Conference is to be institutionalized to 
ensure a community-oriented educational decision-making process (see Figure 2). 
Representatives of all actors in the new education are to serve on the Conference. 
These include learners, learner-leaders, local instructors, and district educational 
representatives. The functions of the Conference are to design the curriculum, re- 
cruit learner-leaders, provide the learning infrastructure, and identify and mobilize 
all the human and material resources within and outside the village for its educa- 
tion and development. The Conference also determines the evaluation criteria in 
conjunction with national authorities as a measure of giving village education a na- 
tional dimension to preempt the possible development of xenophobic communities. 
District, regional, and national levels of the Conference are, to this end, to coordi- 
nate the activities of the villages. 


Implementation 

Education systems are generally not easily amenable to change. Entrenched elite 
interests which are manifested in school stratification, rigid selection, and stiff 
entry requirements, are often threatened by radical and revolutionary education re- 
forms of the nature being proposed here. The privileged reject any moves they con- 
sider to be capable of neutralizing the advantages enjoyed by their children over 
their village counterparts and strive to protect their interests and maintain the 
status quo. In their resistance, they are assured of the support of the civil service 
bureaucracy which they dominate, conservative academics, and the teachers’ union 
which might consider the new learner-leader concept as a threat to teaching jobs. 


Nevertheless, the criticism against the present system, particularly its alleged 
propensity to reproduce inequalities in the social system (Weis, 1979, 1981), may be 
interpreted as a latent desire for an alternative education that addresses the 
inadequacies of schooling. The most disadvantaged under the prevailing system, 
the rural communities, are the most likely to support the reorganization of educa- 
tion, granting the consent of the local leadership (Dove, 1980). Whether enough 
public support can be induced to overcome the powerful political, economic, and 
social forces of the dominant elite and pave the way for the implementation of the 
proposed model, is another issue. 


Events that immediately followed the December 31, 1981 coup d’état point to 
the feasibility of a radical educational reform. High-ranking government officials 
attacked the current system of formal education as inefficient and ineffective, often 
singling out its elitist character for condemnation.’ The model even finds prece- 
dence for some of its aspects in attempts to integrate kwasafo adwuma or commu- 
nal service into the school curriculum in the New Structure and Content of 
Education for Ghana of 1974. Thus, as impractical as the village community 
education-as-an-enterprise model might seem, the developments noted above are 
indicators that obstacles confronting its adoption as a national education policy 
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and system for Ghana are not insurmountable. It must be acknowledged, however, 
that the extent and strength of vested interests in the present system, as demon- 
strated in the short-lived and ineffectual nature of the innovations in education at- 
tempted by the ruling military government in its incipient months,* may impede its 
ever advancing beyond the conceptualization stage. 

On balance, however, the availability of a variety of resources within the rural 
communities that are favorable to the new model is an incentive in itself for the 
model’s consideration for implementation. Such resources include the presence of 
knowledgeable elders and skilled artisans, and elementary school graduates who ac- 
cording to Roberts (1976) are present in every village including those villages with- 
out their own schools. These people together constitute a potential human resource 
base for learner-leader (teachers in the new model) roles under the proposed 
model. Logistical support can also be expected from the school buildings scattered 
in villages across the country. Regarding funding, villagers are well known for their 
enthusiasm in contributing either in cash or through communal labor toward the 
education of their children. In fact, the new model’s income generating capacity 
(arising out of the production aspects of its curriculum) tends to make it self- 
supporting and in accord with current trends toward budget restraint. 


Conclusion 

An attempt has been made to highlight some of the well known criticisms against 
schooling in semitraditional rural communities that have led to a call for its dises- 
tablishment. In response, the author has proposed what he considers a more viable 
alternative education to schooling, one which aims at indigenizing education in 
order to meet more effectively the educational needs of contemporary rural Ghana. 
It is believed that should the proposed model, the village community education-as- 
an-enterprise, win enough support for its implementation, a consistent effort to 
articulate and manipulate the model is likely to make Ghanaian education more 
relevant, more productive, cost-saving, and universal. And with its enabling fea- 
tures of a pedagogy of production and its problem-solving approach, it should have 
a positive impact on village community development. 


One needs to stress, however, that the model cannot be a panacea for the com- 
plex problems that confront the educational systems of Ghana and other African 
countries which share the predominant ruralism of the Ghanaian society. 
Nonetheless, it serves as a reference point for response to the challenge to Gha- 
naian educators by Bentsi-Enchill (1971, p. 13) to make universal basic education 
possible by providing modernized versions of education devices developed from the 
indigenous types. 


The exercise may thus be considered an effort at the institutionalization of au- 
thentic mass-oriented, common education that creates no outsiders or intellectual 
orphans (illiterates, neo-literates, and semiliterates) any longer but guarantees both 
adult and child fee-free basic education. 
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Notes 


il One of the immediate actions of the first Ghanaian controlled government, which was elected in 1951, was the introduction 
of an Accelerated Education Plan, the implementation of which increased school enrollments dramatically within 10 years. 
For example, during the period, primary school enrollment jumped by 211.9%, secondary and technical schools by 437.8%, 
and university by 478.8 % . See Kwame Nkrumah, (1968), p. 77. 


2 Definition in a paper prepared by a Working Group in the Department of Education in Developing Countries, Institute of 
Education, University of London entitled “The practice of community education in Africa” in Kenneth King, (1976), p. 97. 


oe Though formal education has existed for some 400 years, the adult literacy level is between 30% and 40%. 
4, See for example, John Dewey, (1916), p. 12. 


is, The current head of the military government Flt. Lt. Jerry J. Rawlings is reported to have lamented that: “Ghanaians have 
become victims to their environment instead of controlling it.” West Africa, No. 3401, October 11, 1982, p. 2677. 


6. Quoted in S.G. Weeks, (1976), p. 2. 


ie Opposition to the present educational system has come from highly-placed people such as Mr. Kwamena Ahwoi who is re- 
ported to have announced the “Government's intention to abolish private preparatory schools as soon as possible to give 
equal educational opportunity to children of both rich and poor.” West Africa, No. 3458, 21 November, 1983, p. 2707. The 
schools referred to are generally perceived as elitist. 


8. Examples of these innovations include an “Education for National Development Plan,” which envisaged the establishment 
of “community secondary schools” to involve students in productive labor; as stated in the article, “New frontier of hope,” 
by Newton Amedofu in West Africa, No. 3403, 24 October 1982, pp. 2765-2766. 
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In this article, the authors suggest that educational research continues to be domi- 
nated by a particular mode of thought, that of rational empiricism and its adherence 
to a scientific view of the world. They draw upon the work of Jiirgen Habermas (1971) 
to establish a framework of analysis for their claim. Alternative paradigms of how we 
view the world are outlined and related to three different inquiry orientations: empir- 
ical analytic, situational interpretive, and critically reflective. The nature of educa- 
tional research as viewed from these different orientations ts then described. Finally, 
an attempt is made to bring forth the notion of complementarity as a means of 
achieving a fuller understanding of education and research. 


This article will focus on the notion of how our world view determines how we per- 
ceive educational research. It begins by examining the framework developed by 
Habermas and how his ideas can be extrapolated into curriculum (Aoki, 1978). The 
article then tries to show how these ideas can be further developed to reflect vari- 
ous stances toward educational research, and pursues the notion of 
complementarity. 


It is our contention that education, in particular Western education, has been 
dominated for decades by a particular mode of thought, that of rational empiricism 
and adherence to a scientific view of the world, which has led to a rather narrow in- 
terpretation of education on the part of both teachers and students. There exists, 
however, the possibility of widening our horizons and going beyond our currently 
accepted definitions of education. It is this expansion of horizons—this going be- 
yond—which is explored herein. 


As a vehicle to help us explore possibilities, we will investigate some of the im- 
plications of the ideas of Jiirgen Habermas for education. Habermas (1971) 
suggests that knowledge cannot be separated from human interests, and therefore 
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the underlying assumptions which we hold determine how we come to know the 
world. Because these ways of knowing the world are rooted in all humans’ activities 
of work and interaction (including teaching and learning), it is useful to contem- 
plate the potential which these alternative orientations of person/world relation- 
ships offer for deepening our image of schooling and educational research. 


Paradigms serve as frameworks within which groups of investigators may work. 
A paradigm provides the boundaries for inquiries supplying “the group with pre- 
ferred or permissible analogies and metaphors” for determining what will be ac- 
cepted as an explanation and as a puzzle solution (Kuhn, 1970, p. 184). In his 
writings, Patton (1975) provides the following description regarding paradigms: 
Paradigms are normative: they tell the practitioners what to do without the necessity of 
long existential or epistemological consideration. But it is this aspect of a paradigm that 
constitutes both its strength and its weakness—its strength in that it makes action pos- 
sible, its weakness in that the very reason for action is hidden in the unquestioned as- 
sumptions of the paradigm. (p. 9) 
Kuhn (1970) has described the power of paradigms when he states that “Paradigms 
may be prior to, more binding, and more complete than any set of rules for re- 
search that could be unequivocally abstracted from them” (p. 46). 


It becomes apparent that, as educators, it is important for us to uncover these 
unquestioned assumptions if we are to consider the merits of alternative para- 
digms. The adherents to a particular paradigm have acquired a particular predispo- 
sition to viewing the world which serves to filter out seemingly irrelevant events. 
Thus a paradigm provides a perspective from which reality is defined, interpreted, 
and acted upon. A paradigm is much more than a methodological orientation, for it 
provides an orientation for our relationship with the world, a world view. 


Habermas’ Tri-Paradigmatic Framework 

Habermas (1971) has developed a taxonomic structure which allows us to analyze 
the logical structure and underlying assumptions of what he has identified as three 
types of human activity, determined in a large part by the interests of the 
adherents to the paradigm. If Habermas is accurate in his taxonomic structure, 
then it becomes obvious that science is not neutral, nor objective, as is commonly 
believed. Polanyi (1973) has attacked the notion of detached scientific objectivity 
and has claimed that knowledge is a union of that which is both objective and per- 
sonal. Scientific knowledge, for Polanyi, is not detached and impersonal, but rests 
ultimately on belief. The notion of scientific knowledge as impersonal, objective, 
and neutral is a fallacy. This stance is also supported by Medawar (1969) who 
claims that innocent, unbiased observation is a myth. 

A significant number of educators, in particular those interested in curriculum 
theorizing, have become increasingly critical of the dominant paradigm in our 
Western industrialized society, that of strict science with its emphasis on control, 
certainty, efficiency, precision, predictability, and standardization. It is our conten- 
tion that curriculum in general has adopted a technical approach to development, 
implementation, evaluation, and research that has narrowed our view as to what 
education is. School programs are too often presented as a form of reconstructed 
logic (Magoon, 1977) which supports an epistemology separating knowledge from 
the knower, ahistorically and without reference to the social interactions that led to 
its creation. Later in the article we will elaborate on this point and propose alterna- 
tive views of education that, if presented to students, should enable them to 


develop deeper understandings. 
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Aoki (1978) has taken the three forms of knowledge (called strict science, her- 
meneutic science, and critical science) based on differing fundamental interests as 
proposed by Habermas and expanded these into the area of curriculum. His first 
category, the empirical-analytic inquiry orientation has as its basis the seeking of 
technical and explanatory knowledge. Power (1976) has labeled this perspective the 
agricultural-scientific paradigm. Aoki’s second orientation is based on a situational- 
interpretive perspective in which the meanings given by people, in particular 
contexts, are the focus of attention. In the third orientation, the critically-reflec- 
tive, underlying interests and assumptions of social action are considered. 


Empirical-Analytic Inquiry Orientation 

Person/World Relationship 

This orientation leads one to approach the world objectively with the primary in- 
terest of controlling events and objects. Humans view themselves as entities sepa- 
rate and distinct from their environment with the potential to act upon the world, 
to transform it, and to explain events with certainty and predictability. It is “with- 
out doubt the dominant one [orientation] in education research communities 
throughout North America” (Aoki, 1978, p. 58). 


The dominance of this orientation is well illustrated by Hanson’s (1985) classifi- 
cation of the articles published in the Alberta Journal of Educational Research 
over a five year period (1980 to 1984) which revealed that “the majority of articles 
are quantitative in nature and rooted in positivist epistemologies” (p. 234). 


This orientation, or paradigm, attempts to explain how things work and to pre- 
dict the features of phenomena still to occur. The formulation of laws to make such 
predictions are dependent upon the identification of relationships (cause and ef- 
fect) through observation and manipulation of events. 


This orientation allows for the study of humans and their natural world inde- 
pendent of the social interests of the researchers or of those researched, since real- 
ity is accepted as given and existing apart from humans themselves. Objectivity is 
assured, as the knowledge gained is based on observation and experimental evi- 
dence and is therefore not subject to bias (the neutrality of scientific knowledge). 


The technological society in which we live has bestowed on scientific methods, 
in particular the Baconian notion of induction and logical positivism, and Cartesian 
rational thought resulting in a hypothetico-deductive approach, the status of being 
the method for inquiry in a broad spectrum of fields. It is this uncritical acceptance 
of a way of thought that led Apple (1975) to state that “scientific outlooks have be- 
come so ingrained in our consciousness that they have become values, not merely 
ways of gaining knowledge” (p. 123). 

This approach has also led to the separation of theory and practice with the for- 
mulation of an hypothesis, as a component of theory, to guide the practice of an in- 
vestigator who believes that the theory illustrates an ideal state of affairs. The 
hypothesis can then be subjected to verification, frequently in the form of empiri- 
cal testing, deducing that if the hypothesis is correct one may be able to predict the 
occurrence of certain observable events. 


Within this orientation there is a great concern for the validity and reliability of 
the knowledge gained. For this reason, statistical analysis and corroborative empiri- 
cal evidence are often drawn together. 
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Educational Research 

Within the empirical-analytic orientation, research is viewed as constructing a the- 
ory which represents an ideal state of affairs in a classroom and then using this 
theory to guide practice by predicting the effect of certain teaching behaviors on 
specific outcomes, for example, achievement. Novak (1979) indicates that the 
search is for a theory with the potential for “guiding the selection of (science) edu- 
cation events for study, types of assessment measures to use, alternative ways to 
transform data, and interpretation of relevant knowledge claims” (p. 485). 


Once a researcher has achieved an acceptable level of fit (discrepancy model) 
between the data observed and the categories of theory devised, the problem then 
becomes a technical one—to work out an efficient method of applying the theory to 
obtain the ideal state. A major goal of educational research becomes a matter of 
settling the question of “how the instructional sequences of concepts, skills, and 
facts should be organized to optimize for a given student his rate of learning” 
(Suppes, 1974, p. 8). 

The researcher within this orientation sees a classroom existing as a set of vari- 
ables, and by using sophisticated statistical attempts to explain and predict behav- 
iors in terms of these variables. There is concern for both the validity and 
reliability of the theory employed with corroborative evidence being sought for the 
former and statistical provisions made for the latter. This emphasis of current edu- 
cational research exemplifies the technological approach to research which often at- 
tempts to find the “best” means to achieve a set of predetermined ends. This 
empirical-analytic paradigm which has been dominant in educational research for 
so long led Campbell and Stanley (1963) to state that we must instill into young 
experimenters the notion that we must “justify experimentation . .. not as a pana- 
cea, but rather as the only available route to cumulative progress” (p. 3). 

Lowery (1980) supports this notion that (science) education research has been 
dominated by an empirical-analytic paradigm when he writes that educational re- 
search was developed during a time of considerable scientific enterprise with a re- 
search paradigm based on the scientific paradigm. Lowery also notes that the effect 
was to apply methods from the pure sciences to a different area of endeavor, educa- 
tion, which lies within the parameters of social science. Perhaps it is time to recog- 
nize that education is concerned with the “world of people” more than with the 
“world of things.” 


Situational Interpretive Inquiry Orientation 

Person/World Relationships 
This orientation views humans as social beings possessing intentionality leading to 
action (acting upon the world), and giving personal meanings to situations by inter- 
preting events around themselves. Humans are not viewed as separate from their 
world, as in the empirical-technical orientation, but are viewed as influencing and 
being influenced by their surroundings. Each individual gives meaning to his or her 
particular situation, and these meanings may vary from one individual to another 
according to their interpretations. Aoki (1978) states that while “the human activ- 
ity of concern within the empirical-analytic orientation is man’s productive intel- 
lectual capacity to work, the activity of concern for those in the situation- 
interpretive framework is communication between man and man” (p. 60). 

From the situational perspective then, human efforts are directed toward gain- 
ing insights into human experience by means of clarifying and making explicit the 
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meaning of events and activities. Subjective experience of life, of everyday reality, 
becomes the focus of attention. 

Most of the knowledge of the scientific disciplines is socially derived and so- 
cially approved by the scientific community. If the usual tradition of research is 
altered, if the community shifts its ways of knowing, then members of the commu- 
nity must be “reeducated” (Kuhn, 1970, p. 112). In his writing about objective facts, 
Kuhn states: 

Is sensory experience fixed and neutral? Are theories simply man-made interpretations of 
given data? The epistemological viewpoint that has most often guided Western philoso- 
phy for three centuries dictates an immediate and unequivocal, Yes! In the absence of a 
developed alternative, I find it impossible to relinquish entirely that viewpoint. Yet it no 
longer functions effectively, and the attempts to make it do so through the introduction 
of a neutral language of observations now seems hopeless. (p. 126) 

What becomes evident in the situational paradigm is that there exist multiple 
ways of knowing. Reality is socially constructed and not “out there” waiting to be 
discovered; it is dependent on the biographical features of an individual and his or 
her interpretations of events. 


Educational Research 

Within the situational-interpretive orientation, the major focus of research is to 
provide meaningful descriptions and interpretations of events and phenomena. 
Rather than developing a priori theoretical statements for which validation is 
sought, concepts and interpretive frameworks are only suggested by the researcher 
and are tentative and subject to change as more powerful concepts or themes 
emerge. It becomes a major effort of the researcher to make explicit the expecta- 
tions, attitudes, motives, and common meanings that teachers and students have in 
specific situations. The researcher “cannot stand aloof as an observer as is done in 
empirical-analytic research, but must enter into intersubjective dialogue with the 
people in the research situation” (Aoki, 1978, p. 60-61). 


An example of situational-interpretive research is that conducted by Drysdale 
(1983). His research focused on the kind of meanings students and teachers of high 
school biology give to their experiences of working in the field during winter 
months. He used a variety of ethnographic strategies including direct observation, 
interviewing, transcript analysis, and direct participation to arrive at his interpre- 
tations. A conceptual framework using area of being (passive, immediate, respon- 
sible, transcendent) following Rothe (1979) was incorporated to help analyze his 
data and arrive at additional thematic structures. His research lends support to the 
intuitive, but often unsubstantiated, feelings of worth that most science educators 
have regarding the importance of laboratory and field work. 


Another interesting study was carried out by Oldham (1982). Her work at- 
tempted to uncover the meaning of difficulty as experienced by students within the 
context of high school biology. She states, “In contrast to the predominant curricu- 
lum orientation which views difficulty as a phenomenon which can be objectively 
measured and should be eliminated, this study perceives difficulty as a subjective 
and meaningful experience” (p. iv). From analysis of observations, interviews, and 
transcripts she found it possible to determine three levels of meaning for difficulty: 
The first involved participants’ perceptions of difficulty in relation to instructor-di- 
rected activities; the second involved closer scrutiny of the phrases used by partici- 
pants to describe “difficult experiences”; and the third as a mode of “being-in-the- 
world” which is a possibility that is ever present. 
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Other examples of interpretive studies are those of Thompson (1984), Boag and 
Massey (1981), and Pearse (in press). Thompson investigated the professed beliefs 
and views about mathematics and mathematics teaching of three teachers and tried 
to determine how their conceptions influenced their conventions and behavior. 
Boag and Massey conducted a situational study to try and understand two elemen- 
tary teachers’ responses to imposed changes in a social studies curriculum. Pearse 
examined the notion of what it means to be a student teacher in a teacher educa- 
tion program. 

In each of the above studies there exists the notion of trying to uncover the un- 
said or the unmentioned to try to arrive at deeper understandings of the meanings 
participants place on events and happenings which shape our existence. 


Critically Reflective Inquiry Orientation 

Person/World Relationships 

In this orientation, humans are viewed as integral components of their world both 
acting upon and being influenced by their environment, that is, “in and with the 
world” (Freire, 1973, p. 4). However, this orientation has a broader base than either 
the empirical-anlaytic orientation or the situational-interpretive orientation. The 
former is interested primarily in second-order descriptions of social phenomena 
while the latter is interested in first-order accounts typifying the meanings actors 
give to situations in terms of their daily on-going life. The critical orientation is in- 
terested in “questioning these descriptive accounts, whether they be second-order 
or first-order, and in probing for the underlying bases in order to reveal tacitly held 
intentions and assumptions” (Aoki, 1978, p. 61). This questioning, often referred to 
as critical reflection, has with it the notion that humans have the potential to 
change their situations, and the realization of this potential is dependent upon the 
awakening of their awareness to their situations in order that they may broaden 
their choices for action. If humans fail to develop their critical capacity for choice 
of action, they are no longer integrated, but adjusted or adapted to their 
surroundings, and this condition is symptomatic of their dehumanization (Freire, 
1973). Criticism, as a way of knowing that helps free an individual from depend- 
ence on taken-for-granted assumptions, becomes a major focus of this orientation. 

Self-understanding, however, is more than mere introspection where the in- 
quirer may fall into the ontological “trap of solipsism” (Wagner, 1983, p. 39). 
Instead, it calls for the inquirer to articulate the links between self-knowledge and 
the actions of everyday living. Recognition of discrepancies between interests and 
the existing quality of life must be followed by modification in attitudes and 
actions in everyday life. This reciprocity between reflective thought and action is 
often referred to as praxis (Groome, 1980). 

The critical perspective calls for an examination of the metaphors employed by 
participants in any field, as these metaphors can indicate to a large extent the 
thought processes which determine the kinds of questions asked. In high school bi- 
ology programs examples of central metaphors are those of repression and regula- 
tion, feedback and control. It should not be taken that questions which originate 
from such metaphors should not be asked, but that the framework and assump- 
tions which give rise to such questions should be identified. 
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Educational Research 

Researchers in education have largely taken for granted educators’ problems and 

accepted the educators’ definitions of the problems along with their assumptions 

and interpretations. If, however, we are to regard education as a social activity 

where the participants are actively creating reality, we must, as Young (1974) 

suggests, “bracket out or suspend our belief in the ‘givenness’ of that-which- 

scientists-claim-to-know, and indeed that which educators claim to know” (p. 58). 

What we must be attempting to do is ask, and attempt to answer, questions such as 

the following: 

1. What knowledge is of most worth? 

2. Who will decide what knowledge is acceptable? 

3. Why do we educate students in various disciplines (i.e., whose interests are 
being served)? 

The critical researcher inquiring into such questions seeks to uncover the as- 
sumptions involved in answering these questions rather than examining the mode 
of presentation in school. Concerns relating to epistemology, power relationships, 
root metaphors, politics of schooling, and other factors that interplay to construct 
that phenomenon we know as education need to be examined and exposed so that 
participants have a clearer view of their situation. Only then can responsible action 
take place which challenges the traditional notion of school disciplines as being 
fixed and asocial, whereas in reality they are socially constructed. 


A number of studies have recently been conducted which attempt to critically 
examine different aspects of schooling. Rowell (1983) examined the interactions 
which constituted the planning for implementation of the International Baccalau- 
reate Program in an urban school district. Her study involved becoming a partici- 
pant observer in the biology teachers’ planning group, interviewing school trustees 
and administrators, attending workshops and information meetings, and textual 
analysis. Her framework for analysis consisted of viewing program implementation 
“as installation, as interpretation, and as reflective action” (p. v). She found a prev- 
alence of technical concerns over situational or reflective ones as the predominant 
perspective of implementation. Deeper analysis of the data led her to state that 
“features of the discourse (for planning) are indicative of ideological practice in 
which the activities and interests of the practitioners are aligned with the con- 
cealed interests of dominant social groups upholding the educational institution” 
(p. v). 

In another study, Benson (1984) examined the interpretations of science that 
were held by grade 12 biology students. Benson participated as an observer in a 
number of classrooms, interviewed students and teachers, and examined curricular 
materials to arrive at an understanding of what were the students’ interpretations 
of science and why they held the interpretations they did. The data indicate that 
school science influences greatly the notion some students have of science (al- 
though some of the more astute students were quick to point out the difference be- 
tween “school science” and “real science”) and that teachers and curriculum 
materials are often presenting a modified Baconian notion of science. As well, 
teachers frequently use “objective” knowledge as a means of power and control in 
the classroom. 


Hendry (1983) reviewed person/world relationships and their influence on de- 
termining environmental values. Her analysis looked at the historical roots of value 
systems in environmental education and tried to trace the linkage of the taken-for- 
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granted assumptions that we hold today regarding environmental education. These 
assumptions need to be examined if we are to avoid the wanton destruction of the 
environment, as currently seems to be the case. Hendry notes, “It is through the 
schools, teachers and students, that recognition, awareness, and understanding of 
environmental values in our man/world relationships will be achieved” (p. 38). 


Complementarity 

The question now needs to be raised as to whether the various perspectives discus- 
sed are mutually exclusive, or if there is some common ground upon which re- 
searchers can come together. A number of authors have indicated that there are 
fundamental epistemological differences between world views and we are not sim- 
ply looking at differing methodologies. Smith (1983) for example, suggests that the 
realists’ (empirical-analytic) perspective is one which views the world as a subject- 
object relationship where there is an independent reality, where physical and social 
scientists have a similar relationship, and where there is a separation of fact and 
value. Idealists (situational-interpretive and critical), on the other hand, view the 
world in a subject-subject relationship which rejects the dualism of mind and real- 
ity and which intermingles facts and values. How one carries out the process of in- 
vestigation has serious epistemological considerations. Further, he suggests that 
rather than “accept the unfounded assumption that the methods are complemen- 
tary, we must insure that the problem is the subject of serious and extended de- 
bate, not only among philosophers, but even more important, among practicing 
educational researchers” (p. 13). 

In a follow-up paper, Smith and Heshusius (1986) continue to emphasize the 
notion that there is a difference between perspectives (i.e., it is not simply a differ- 
ence of “methodologies”). They call for a continuation of the debate and state that 
the attempt to avoid conversation is wrong. Many of the claims for compatability, 
they suggest, are often related as how-to-do-it (an attempt to quantify qualitative 
methodologies) and that research is a matter of “what works.” Many educational 
researchers, they claim, have arrived at “the unfortunate conclusion that there are 
no longer any significant differences between the two perspectives” (p. 10). 

A supportive train of thought was put forth by Tuthill and Ashton (1983) who 
called for an organization of educational communities around educational para- 
digms, ones that practitioners and researchers can share. In their view, differing 
paradigms result in unsharable understandings and knowledge. Researchers and 
practitioners must work together to solve similar paradigmatic problems. 

Howe (1985), however, is of the belief that paradigms which call for either qual- 
itative or quantitative research strategies are fundamentally the same and in the 
final analysis are based on interpretation. To him, the value-free value-laden di- 
chotomy of the positivists is false and combining approaches is on solid epistemo- 
logical ground. 

Smith (1983) as well as Miles and Huberman (1984) would support Howe’s 
ideas calling for combining various approaches. They would acknowledge that there 
are differences between approaches (perspectives) but that we can get a better no- 
tion of what we are investigating if we use a variety of methods. 

Olson (1986) also leans in this direction. He indicates that he sees a shifting of 
knowledge and that hermeneutic (interpretive) science and natural science have 
come closer together. What is different between them is the objects of study. He 
claims that their epistemology is similar, as is their methodology, and that logical 
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positivism (in the traditional sense) is fading. To Olson, the distinction between ex- 
planation (natural science) and understanding (interpretive science) is relative and 
they are probably much “more closely related than has previously been noted” 
(p. 168). 

Holmberg (1982) accepts the notion of empirical and interpretive research and 
suggests that for purposes of explanation (empirical) and understanding (herme- 
neutic) the two are complementary in that they are both rational activities. How- 
ever, he would not agree with a critical stance such as that proposed by Habermas 
(1971) in that he sees a critical perspective as being too subjective and presenting 
biased beliefs from a nonneutral starting position. Value judgments are not open to 
investigation and are therefore irrational and unacceptable as scholarship. He does 
not, however, help the reader to understand how his claims for rational research 
(i.e., empirical and interpretive) are indeed value free. 


Perhaps the positions taken by Soltis (1984) and Mosenthal (1985) most closely 
agree with that taken by the authors of this article. Soltis calls for an associated 
community of researchers where not all researchers do all kinds of research, but do 
what they do well. Pedagogy, for Soltis, is a basic human activity, extremely com- 
plex in nature, which requires a combination of research. It follows then, that “re- 
search in education broadly conceived must be empirical, interpretive, normative, 
and critical” (p. 9). 

Mosenthal (1985) writing on the progress of educational research identifies 
three approaches for carrying out research: one, a literal-normative perspective 
which has an ideal (i.e., definition) in mind and typically invokes a discrepancy 
model based on correspondence theory; two, an interpretive perspective in which 
no ideal exists and definitions represent constructed interpretations of reality; and 
three, an evaluative perspective in which there are evaluative definitions of pro- 
gress which have ideological implications of the definition of the phenomenon for 
the larger society. Each of the definitions of educational progress makes different 
claims as to how educational research contributes to practice, conceptualizes the 
theory-practice relationships differently, and focuses differently on the question of 
how educational research might improve practice. He calls for complementary ap- 
proaches to what he sees as one of the major problems of educational research, that 
of partial specification, and states that for educational progress to take place, pro- 
gress must be made using all three perspectives—literal, interpretive, and critical. 


Concluding Comment 

It was not our purpose to minimize the importance of any one particular 
orientation, nor to suggest that there exists, or should exist, a hierarchy regarding 
the importance of differing paradigms or orientations. As with any categorization 
or classification system, including the analytic framework developed by Habermas, 
we are reducing the phenomenon (i.e., educational research) to less than what it is 
for the purpose of discussion and clarification. In so doing, we have a tendency to 
view differing orientations in juxtaposition, often perceiving them to be in competi- 
tion with one another bringing about a tension among scholars in the field, often 
causing us to build walls between various camps. It is possible, however, to view 
tension in a positive way as a force which will bring about deeper dialogue among 
those of us engaged in educational research and help us to form bridges, not walls. 


It is hoped that what was presented will help to bring about an awareness of 
complementarity; that what is needed in education is a multiplicity of views and 
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orientations along with differing approaches to research. Using a particular frame- 
work (paradigm) to guide our thinking not only leads us to certain kinds of 
outcomes, but also limits the kinds of questions we can ask. If we wish to come to a 
fuller understanding of the phenomenon known as education, we had best be pre- 
pared not only to be tolerant of, but also open to, alternative perspectives, 
struggling toward the possibility of moving beyond the mere alternatives to a fuller 
understanding of what education indeed is. 
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BOOK REVIEWS 


AN INTRODUCTION TO EDUCATIONAL ADMINISTRATION IN CANADA. 
By Leslie R. Gue. Toronto, ON: McGraw-Hill Ryerson, 1985, 274 pp. 


It is important to remember that this book is written for students at the under- 
graduate level—people who, for the first time, will be thinking about educational 
administration. In terms of the intended audience the book is introductory, but 
topics in the first chapter, such as a philosophy of administration, administration 
as a process in context, and phenomenology are issues which seasoned administra- 
tors are currently debating. 


The author’s stated purposes of touching upon the core of educational adminis- 
tration as a foundation for further study and of presenting administration in a pos- 
itive light are laudatory. Topical issues of human rights and the women’s 
movement, along with the challenge of making the administration of schools more 
human and more responsive, are introduced early in the first chapter. Few would 
argue with the central philosophy of the book that administration is always a facili- 
tating process with the emphasis on context. Chapters dealing with administration 
as a cultural universal, bureaucracies, the organization of education in Canada, or- 
ganizing schools for effective instruction, teachers and administrators as profession- 
als, and the harmonious school (discipline) are not out of the ordinary. The second 
last chapter on global, national, provincial and system issues with guidelines for 
coping with issues, and the last chapter about administration as a positive force, 
make a more unique contribution to the literature. 


The sequence of chapters and the selection of content have caused this reviewer 
some concern. The chapter on professionalism, for example, is between the chap- 
ters on organizing schools and the harmonious school. In the first chapter adminis- 
trative processes are identified as decision-making, organizing, implementing, 
planning, coordinating, communicating, and evaluating, yet in the second chapter, 
it is stated that supervision is part of the process of administration and that 
teachers are both supervisors and supervisees. Playground supervision and the 
visits of others in the classroom to supervise are a prominent part of a beginning 
teacher’s life at work and the author does not give the topic the attention it de- 
serves. In the opening chapter “Phenomenology” is as prominent a heading as “Ad- 
ministration: A Process in Context” and has more space than the topic “Teachers as 
Administrators.” A second concern regarding selection of content is the highlighting 
of bureaucracies in the second chapter at the expense of considering schools at the 
board level as political systems and at the classroom level as social systems. 


A major shortcoming of the book is a lack of consistency. In the foreword to the 
book it is observed that education is in the political domain. chapter 6 stresses that 
everyone wants limits, yet in the section on humanizing complex organizations it is 
stated that, “if the group is free of control or influence from external forces, it is 
more likely to be successful and productive.” One wonders about the justification 
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for the quoted statement and to suggest that schools, or even classrooms, could, or 
should, be free of external influences is not consistent with reality nor with other 
points emphasized elsewhere in the book. A further illustration of lack of consist- 
ency is the stated importance of making organizations more human, of fostering 
creativity, and of seeing administration as a joy, yet the areas selected for emphasis 
in the appendices are timetabling and a list of 65 questions for consideration when 
starting in a new school—hardly a creative approach! Why is the model for illus- 
trating the administrative process in context, presented in the first chapter, not 
utilized as a means of integrating ideas and approaches throughout the book? 
Rather than the list of questions, consideration of the culture of the school-commu- 
nity as one antecedent from the model would be a reasonable, professional, and 
challenging approach for a new teacher to follow. 

Surprisingly, situational leadership and the literature on effective schools are 
neglected in the book. School renewal is given one-half page. Contemporary dilem- 
mas such as centralization-decentralization (control-autonomy) are not made ex- 
plicit and examined in some depth. More alarming in a book on educational 
administration is the advocacy of a human relations model of leadership. In the 
second chapter it is said that “good administrators in this generation use consulta- 
tive rather than directive administrative styles.” The fact that strong leadership is 
characteristic of effective schools, the increasing evidence that teachers’ beliefs 
often change after behavior does, group maturity, and the pleas for tighter coupling 
of educational organizations are not taken into account. A further comment in the 
book that good administrators make a conscious attempt to relate warmly to all 
levels of the hierarchy leads this reviewer to conclude that the human relations 
level of administration is considered to be the highest level. Most categorizations of 
leadership have at least an educational and a problem-solving, or a symbolic and a 
cultural, level beyond human relations. What is even more bewildering is that the 
author, after advocating a consultative style in early chapters, reports in the fourth 
chapter that there are few conclusive findings regarding participative decision- 
making. 

Undergraduate students would likely appreciate the emphasis on human beings 
in this book and the attention given to culture. They would be impressed with 
topics pertaining to the Keegstra affair, multiculturalism, Third Wave Schools, and 
current issues such as a curriculum for the future, education of Natives, main- 
streaming, and computer-related instruction. They would be perplexed, however, 
by the conclusion in the opening chapter that both phenomenology and social 
systems theory have a place, and by a paragraph dealing with quantitative and 
qualitative methods in research. They would be disappointed at not finding useful 
guidelines for entering into contracts and at a lack of attention to legal considera- 
tions such as negligence. To beginning teachers such topics are a higher priority 
than details about federal grants or a long discourse on the status of teaching as a 
profession. 


In summary, this book is a useful addition to the literature on educational ad- 
ministration. This reviewer would recommend it as a reference book rather than as 
a text for undergraduate students. 

Earle E. Newton 
Associate Dean of Education 
University of Saskatchewan 
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MEMORY AND INSTRUCTION. By David A. Baine. Englewood Cliffs, NJ: Edu- 
cational Technology Publications, 1986, 338 pp. 


This book is an attempt to bridge the gap between theory and practice in the areas 
of psychology and education. Explanation of teaching methodologies in relationship 
to cognitive psychological research findings is long overdue, and is a sign of the in- 
creased awareness and education of the regular and specialized classroom teacher. 
Teachers have used a variety of instructional techniques without really knowing 
why one method is effective for one student while it does not work for another. 
Historically, psychologists have investigated aspects of memory function without 
any transference to the classroom setting. Similarly, educators have devised in- 
structional techniques which are without a theoretical base, and it may well be the 
role of the educational psychologist to interpret theory into practical application. 
Dr. Baine devotes chapter 1 to “Memory Models and Research” and Chapters 2 
through 12 to instructional application which includes “Mnemonic Strategies,” 
“Cognitive Behavior Modification,” “Prose Recall,” “Instructional Models and 
Methods,” “Instructional Sequences,” and “Pictures and Mnemonics.” 


Four models of Memory are cursorily described in chapter 1. The multistore 
Model of Waugh and Norman (1965) and Atkinson and Shiffrin (1968) is described 
and critiqued with the introduction of the short-term memory/long-term memory 
debate. The Levels of Processing Model of Craik and Lockhart (1972) is described 
and the limitations of the model noted. According to Dr. Baine, the “currently pop- 
ular models of memory are schemas and scripts,” which are briefly defined and 
rather less briefly commented upon. The remainder of the first chapter is a review 
of procedures typically used in the research of memory functions. Dr. Baine de- 
scribes this research as basic memory research which is usually conducted in labo- 
ratory settings and has little ecological validity. He gives the reader a strong 
warning not to generalize laboratory findings in memory research to the natural en- 
vironment because most experimental studies are conducted on college-aged stu- 
dents rather than on children and adolescents. 


The three chapters entitled “Mnemonic Strategies” give a comprehensive and 
cohesive explanation of memory research and its applicability and limitations for 
instructional techniques. A variety of mnemonic strategies are described which in- 
clude chunking and categorizing, labeling, visual imagery, and syntactic and seman- 
tic encoding. Each strategy is supported by research studies which have been 
conducted on children of varying ages and intellectual abilities. A summary chart 
showing the studies reported by age level of subjects and methodology would have 
been a useful inclusion. The developmental nature of memory and use of mne- 
monic strategies could have been more clearly defined. However, this reviewer is of 
the opinion that the three chapters on mnenomic strategies successfully bridge the 
gap between theory and application in memory research. 

In the two chapters dealing with prose recall and comprehension, the author re- 
views research of narrative and expository prose. The limitations of the methodolo- 
gies used in this type of research are well outlined. A review of measurement 
instruments, type of prose, and age of students sampled in the studies indicates 
little generalizability and comparability between or among studies in this area. 
There is a detailed description of narrative and expository prose which should 
prove useful for both educators and researchers. The role of narrative and exposi- 
tory prose in the developmental reading process is explained, together with some 
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practical examples of instruction. The confusion and contradiction in the area of 
reading research is evident in the description of studies investigating the effect of 
script size, passage length, and pictures on comprehension. Unequivocal results will 
obviously prevail without agreement on a definition of reading comprehension and 
some replication of measurement instruments and methodologies. The review of 
studies investigating the effectiveness of underlining and notetaking is informative 
for both student and teacher. The chapter includes some practical suggestions for 
aids to recall for learner and teacher. 

The five chapters which cover mnemonics and comprehension provide cohesive 
descriptions of the application of researched techniques. Unfortunately, the chap- 
ters entitled “Metamemory,” “Cognitive Behavior Modification,” and “General- 
izations and Maintenance” are weak in review of research and confusing in the dis- 
cussions of instructional techniques. All these chapters are polemic yet supply no 
summary, conclusions, or final synopsis. 


Chapter 12, “Pictures and Mnemonics,” seems misplaced within the organiza- 
tion of the whole book. Instead of being the final chapter it would appear to be 
more appropriately and logically placed following the early chapters describing 
mnemonic strategies. “Instructional Models and Methods” impresses as a final 
chapter because it contains a summary of most of the instructional procedures pre- 
viously described in the book. This chapter is well organized and clearly explains 
instructional techniques such as prompting and fading, chaining, structural instruc- 
tion, and modeling. The following chapter, “Instructional Sequences,” provides 
specific illustrations of procedures mentioned in the previous chapter. Lesson plans 
outlining instructional directions using readiness signals, recall, review reinforce- 
ment, and correction are illustrated using the application of morphographic and 
phonetic spelling rules. This chapter contains valuable information for the class- 
room which may have been better placed as an appendix to the main material. 


There is no conclusion to “Memory and Instruction.” The book ends, as does 
every chapter, with the description of another study without any attempt to 
synthesize or summarize. This creates an impression on the reader of having read a 
review of a long list of studies. Another irritating element of the book is the seem- 
ingly indiscriminate use of italics. One assumes that words and phrases the author 
considers of importance to the reader are italicized. In fact, the use of italics is in- 
consistent which distracts the reader from comprehension of the text rather than 
enhancing comprehension. For example, the use of italics to introduce or emphasize 
terminology such as “free-recall” or “paired associate” is comprehensible, but the 
italicizing of the the words “difficulty” or “example” within a sentence breaks the 
reader’s concentration and interrupts the text unnecessarily. A glossary of terms 
would have been a welcome addition to the book and may have helped minimize 
the terms which required italics. 


In his book Memory and Instruction, David Baine has provided the reader with 
some excellent suggestions for instructional techniques. This reviewer is disap- 
pointed by the promise of the title to give equal time to memory. The first chapter 
appears to represent the author’s attempt to explain a century of research in the 
area of memory. It is inadequate and could even be misleading to educators who 
may feel it is of no consequence to investigate theoretical research after reading 
that chapter. The two chapters on mnemonics represent one of the most cohesive 
texts written on the subject of memory and instruction and makes the book well 
worth reading for educators and psychologists. 
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As a text for describing instructional techniques I think the book will be useful 
for undergraduate methodology courses, but as a text on memory and instruction it 
is incomplete. The book falls short of outlining a theoretical base from which in- 
structional techniques are formed, and consequently does not fulfill the expecta- 
tions which the title creates. The attempt to meld theory and practice in the field 
of educational psychology is worthwhile and needs to be pursued. It is regrettable 
that the attempt is not successful in Memory and Instruction. 

Janice A. Booth 
Psychology Department 
British Columbia Children’s Hospital 


CANADA’S COMMUNITY COLLEGES: A CRITICAL ANALYSIS. By John E. 
Dennison and Paul Gallagher. Vancouver, BC: University of British Columbia 
Press, 1986, 360 pp. 


Should every student of higher education in Canada have a copy of this book on his 
or her shelf? The answer is Yes, but with some reservations. This book would ap- 
pear to be particularly useful for students being introduced to the study of the or- 
ganization of postsecondary education. To these readers, the book could be 
valuable as a survey introduction and serve as an excellent resource document. In 
particular, the comprehensive bibliography, which includes separate listings of 
French language sources as well as English language theses, is a valuable compila- 
tion. One also might imagine that the book could be useful to members of college 
organizations, particularly as a document which could orient one to the diversity of 
Canadian community college settings and institutions. 


Canada’s Community Colleges, however, is not without shortcomings. It takes a 
special kind of courage for authors to engage in the daunting task of attempting to 
record the diversity of Canada’s community college scene in a single publication. In 
many ways, Dennison and Gallagher have produced a document which represents a 
valuable contribution in terms of survey information, but the complexity of the 
task they undertake is reflected in the structural weaknesses inherent in their final 
product. 

The book is a perplexing amalgam of commentary and description. Historical, 
political, operational, and/or financial perspectives of the institutions are often pre- 
sented without any explicit sense of a systematic thematic development. Occasion- 
ally a comparative approach is taken; at other times the authors use isolated 
illustrative examples. The result is a treatment which is uneven and often 
superficial. In desperation, the reader looks for a consistent framework of analysis, 
a coherent guiding vision, or the systematic development of the authors’ viewpoint. 
In the absence of these structural elements, the book tends to become a bland and 
disjointed description, concluding in a rather unconvincing summary statement. 
The reader of what promised to be a critical analysis is left with a problem not un- 
like that faced by a diner partaking of a Chinese dim sum breakfast. One tastes a 
multitude of dishes but there is little sense of pattern. In the end the diner has 
little choice but to attempt to bring his or her own order to a sometimes confusing 
array of tidbits wheeling past. It is not my intent to challenge the veracity of the 
material included in the book, but after reading one is left with a deep suspicion 
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that the services of a strong editor might have left us with a document which could 
have provided greater insight, understanding, and a more focused critique. 

Turning to the substance of the book, in introducing their intentions the au- 
thors appear to have cast a rather broad net. The main features of their agenda are 
apparently as follows: To provide the story of how and why colleges were estab- 
lished in Canada; to provide an assessment of what colleges have become; to pro- 
vide an appraisal of the tests that colleges will face during the next 15 years: and 
finally, to provide a reappraisal of the role and mission of Canada’s colleges. A tall 
order! In addition, Dennison and Gallagher seem to be intent upon making a case 
for—if not a national eucational system—at least the development of what they 
refer to as a “national consciousness.” The roots of this desire and the precise na- 
ture of a “pan-Canadian consciousness” are not made explicit. But, interestingly, 
the whole notion seems to become less and less credible as the book develops. I 
will, however, return to this issue at the conclusion of this review. 


Chapter 1, which extends to five pages, is purported to be a discussion of the 
various influences which have affected Canadian community college development. 
Notwithstanding the fact that the authors assure us that reviewing this develop- 
mental process will be an “adventure into the Canadian consciousness,” chapter 1 
barely provides the content sufficient to warm to the subject. 


The “adventure,” however, is resumed in chapter 2. The reader is plunged into a 
series of province-by-province historical vignettes describing the development of 
college-like institutions. Interestingly, as chapter 2 develops, each historical 
vignette abruptly ends before there is a description of recent college operations. We 
discover later that the descriptions are taken up once again in chapter 4, but this 
represents a rather odd and unnecessarily disconcerting arrangement of the mate- 
rial. 


As the institutional developments are revealed, the structural, operational, and 
philosophical diversity across provincial settings is really quite breathtaking. At the 
same time, this diversity tends to leave the reader very confused. The authors pro- 
vide no introductory description of the characteristics of a community college, nor, 
as an alternative approach to structuring the material, do they explicitly organize 
the description around characteristics common to all the institutions they are deal- 
ing with. The reader is soon lost in myriad details. One finds oneself asking: How is 
it possible that this apparent institutional diversity can be classified as a single 
generic type—a community college? 

Curiously, chapter 3 seems to have been inserted to serve the purpose of re- 
sponding to this concern, but this represents an odd placement of synthesis mate- 
rial. In chapter 3 the authors intercede with what appears to be a textbook list of 
community college characteristics. They make a weak argument that this particular 
listing is indeed grounded in the Canadian experience and emerges from their de- 
scriptions in the preceding chapter. Never mind that the typology of characteristics 
clearly does not provide a satisfactory overlay for the diversity of institutional 
structures and operations which have been presented. They choose to ignore this 
fact; hence, their assertion remains unconvincing. 

The second half of the book is taken up with what appears to be a collection of 
discrete essays relating to issues which the authors have identified as important to 
Canadian community colleges. The essays cover such matters as the future, institu- 
tional identity, institutional politics, and institutional excellence. Interestingly, 
Dennison and Gallagher again make very little effort to account for the institution- 
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al diversity their vignettes revealed in the first segment of the book. Time and 
again, the issues and organizational elements are discussed as if the college-like in- 
stitutions are very similar and/or emerge from very similar sociopolitical settings. 


It is, of course, clear that such is not the case, and Dennison and Gallagher 
often acknowledge that there is diversity, yet the discussions are frequently pre- 
sented as if the institutions are all cut from very similar cloth. Where differences 
are acknowledged and highlighted, the authors’ agenda seems to be to call atten- 
tion to selected ideas or approaches in order to facilitate institutions in fashioning 
more homogeneous ways of dealing with their local settings. There seems to be a 
desire on the part of the authors to see a nationally coordinated system of college 
education. This orientation returns us to the earlier mentioned notion of a 
pan-Canadian consciousness. 


As suggested, the notion of pan-Canadian consciousness is very weakly drawn 
by the authors. The idea, however, is apparently driven by a desire for an “im- 
proved” national sense of citizenship. The community colleges, through their “com- 
mon college culture,” are seen by Dennison and Gallagher as the institutional 


vehicles through which this consciousness might be developed. 


Aside from the fact that their book does little to convince us that there is a na- 
tional college culture, the idea of the search for and/or the development of an inte- 
grated, pan-Canadian consciousness (the very notion makes one draw back in 
horror) to my mind represents a wrong-headed, technocratic, utopian ideal. It ap- 
pears to be an idea built upon the notion that the social and political diversity 
which is Canada ought to be viewed as somehow being broken and in need of fix- 
ing. Cast in different clothing, this idea seems to be the tired old quest for a 
unifying and largely mythological Canadian identity. 

What the authors seem to have chosen to ignore, while invoking their idea of a 
pan-Canadian consciousness, is the Canadian paradox: Canadian shared conscious- 
ness is the very diversity and the differences we observe in our society. This shared 
diversity is reflected in the institutions we create for ourselves. To the extent that 
diversity exists in our country, it is a national strength. The lack of a pan-Canadian 
consciousness ought to be seen as a celebration of this mosaic. To the degree that 
the community colleges in Canada reflect and enrich a locale—with due regard for 
the larger social context—all Canadians are well served. 


Even though Dennison and Gallagher apparently would like us to be more like 
one another, there obviously is no one best Canadian college model. Dennison and 
Gallager have not presented convincing arguments that there can or should be 
clearly defined, national standards of institutional excellence. In fact, the disso- 
nance caused by educators continually running up against a wide variety of institu- 
tional experiments in Canadian nonuniversity settings is, to my mind, a very 
healthy situation. As Canadians, we move to homogenize our college educational 
experiment at our peril. The first sentence of this book states: “Educational institu- 
tions reflect as well as shape their societies.” I implore the authors to look again! 


Don Chapman 
Program of Adult and 
Higher Education 
University of Alberta 
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PREPARATION OF MANUSCRIPTS 


1. All manuscripts must be typewritten, double spaced, and submitted in 
triplicate and are not to exceed 25 pages in length. 

2. An abstract of approximately 100 words in length, typed on a separate page, is 
to be included. 

3. The author’s name and affiliation should appear only on a separate title page 
to ensure anonymity in the reviewing process; the title of the paper should ap- 
pear on the first page of the manuscript as a means of identification. 

4. Tables must be numbered in Arabic numerals with the word “Table’ aligned 
center, e.g., Table 4. The heading of the table is to appear below. 

5. Graphs and charts should be used only if essential. They must be carefully pre- 
pared ready for reproduction. Graphs must be properly labelled using Arabic 
numerals, e.g., Figure 3. 

6. Each table or figure should be presented on a separate page. The position of 
tables and graphs should be clearly indicated within the text by inserting at 
the relevant point the phrase (Insert Table 2 here). 

7. Sources should appear in parentheses following the reference, citing the 
author’s name (unless the name appears in the text), the year of publication, 
and page number if appropriate. For direct quotations, the reference should be 
cited and the page number given in brackets before the final punctuation of 
the quotation. The references should be listed alphabetically by authors’ last 
names at the end of the manuscript under the heading, References. 

8. Explanatory notes, numbered consecutively and identified in the text with a 
superscript, may be included under the heading of Notes. They should be 
double spaced and placed at the end of the manuscript immediately preceding 
the References. The citing of references and quotations in the Notes should 
conform to the procedures outlined in No. 7 above. 

9. Spelling shall conform to Webster’s New Collegiate Dictionary, except in the 
case of direct quotations which must conform to the original. Editorial altera- 
tions will be made if necessary. 

10. In matters of style, the third edition of the APA Publication Manual is consid- 
ered definitive. 

11. Submission implies that the manuscript is not being considered for publication 
elsewhere. 


The views expressed and the accuracy of the statements made are the responsibil- 
ity of the individual authors. The editor is solely responsible for the editorial 
comments. 


AJER is indexed in the Alberta Education Index, Canadian Education Index, 
Current Contents/Social and Behavioral Sciences, ERIC/Current Index to Jour- 
nals in Education, and Social Science Citation Index; appropriate articles are ab- 
stracted in Cognitive Development Abstracts, Educational Administration 
Abstracts, Educational Technology Abstracts, Family Abstracts, Language Behav- 
ior Abstracts, Multicultural Education Abstracts, Psychological Abstracts, 
Research into Higher Education Abstracts, School Organization and Management 
Abstracts, and Sociology of Education Abstracts, and reviewed by the Journal for 
Research in Mathematics Education. 
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Manuscript Assessment 


An important part of the research process is the sharing of results with the scholarly 
community. One way of doing this is by writing an article for a research journal. Most 
journals that carry reports of empirical studies tend to use similar evaluative criteria 
in the assessment of manuscripts. These criteria may be stated in the form of nine 


questions: 

1. Are the conceptualization and rationale of the study apparent? 

2. Has previous pertinent literature been cited? 

3. Has the continuity between the study and previous work been demon- 
strated? 

4. Are data sources and collection/analysis procedures described and ap- 
propriate? 

5. Has the trustworthiness of the study been established? 

6. Are the conclusions substantiated by the evidence presented? 

7. Have the similarities and differences between the results of the study 
and the work of others been clarified? 

8. Have the theoretical, practical, and/or methodological implications been 
identified? 

9. Has the contribution or significance of the study been assessed? 


If you are planning to write an article, or are in the process of writing one, keep 
these questions in mind. 
If you have just completed a manuscript and can answer yes to each of these ques- 
tions, then send it to us. Chances are we will publish it. 
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Computer Technology and Education 


Realizing the educational potential of computer technology largely depends on devel- 
oping appropriate policies related to the technology. A policy Delphi methodology 
was utilized to identify those changes in education which were probable and possible 
through the continued introduction of computers as well as to identify potential pat- 
terns for arriving at a normative (desired) future. Participants in the study panel con- 
sisted of 18 individuals with a knowledge of computers and a background in 
education. Data collection included a pilot study followed by three distinct iterations 
or rounds. The conclusions emphasize the need for ongoing activity related to policy 
making especially in areas related to determining educational priorities and strat- 
egies for dealing with technological change. 


It is generally agreed that computers are the dominant technological artifact of our 
time. Lepper (1985) reflects the view of numerous writers: “We are currently on the 
edge of a revolution in technology that may eventually prove more sweeping and sig- 
nificant than any other technological advance in the last 200 years—the revolution 
that is likely to occur as powerful microcomputers begin to infiltrate our lives” (p. 1). 
Masuda (1980) describes computer technology as fueling a transformation from an in- 
dustrial society to an information society. The widespread acquisition of computers 
by schools is often cited as reflecting this societal transformation. Numerous reports 
attest to the rapid growth in numbers of computers in the schools of most of the prov- 
inces and states (Bork, 1984; Morgan, 1984; Petruk, 1985; Weiner, 1984). Computers 
have generated a great deal of enthusiasm and excitement as schools continue to ac- 
quire them at a rapid rate. Shavelsom and Salomon (1985) identify a major reason for 
these developments: “For the first time, a genuinely ‘thinking’ interacting technology, 
the computer, has become readily available to education. ... The pedagogical promise 
of the new information technology is boundless” (p. 4). 

Despite the large-scale acquisition of computers in schools there is increasing 
concern that they are having a limited impact on the instructional process (Becker, 
1984; Bork, 1984; Ebisch, 1984; Papert, 1979; Shavelson et al., 1984; Sheingold, Kane, 
& Endreweit, 1983; Weizenbaum, 1979). Two contrasting visions of the potential of 
computers in education are presented by Kohl (Ebisch, 1984): 


Dr. Steier is a visiting assistant professor with the Department of Organizational Analysis, Faculty of Business, and also lectures 
in the Department of Educational Administration. 
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Computers are special, they’re not just an electronic version of the same old boring school 

tools. But they could be used to reinforce the worst practices of schools. ... The other possi- 

bility is that the computer becomes a tool to extend the capabilities of one’s mind, a device 
for modeling possible worlds, for doing scientific simulations, for allowing kids at a very 
young age to begin to see that you can control variables and understand systems and do 

these intellectual things visually. (p. 38) 

Kohl’s more promising pedagogical vision of computer technology will not be 
achieved easily. Shavelson and Salomon (1985) suggest that: “The impact of the new 
technology on cognition is not guaranteed. Its impact depends largely on how stu- 
dents and teachers use the new technology” (p. 4). For Lepper (1985), the time for re- 
search is now: “Yet it seems critical to examine the larger issues, before this 
technology becomes such an integral part of our daily lives that it will be, as was the 
case with commercial television, too late to ask critical questions” (p. 16). Representa- 
tive of much of the questioning in the area are those questions with a futures 
orientation. By gaining a clearer vision of the future, asking critical questions, and es- 
tablishing appropriate policies, we can make better use of this new technology. 

Realizing the educational potential of computer technology largely depends on 
developing appropriate policies relevant to the technology. Dunn (1981, pp. 6-24) es- 
tablishes that historically there has always been a need for information relevant to 
policy making. He also observes that the industrial revolution and more recently the 
advent of a “postindustrial society” have accelerated the need for information rele- 
vant to policy making. 

A recognized means of generating information relevant to policy making are pol- 
icy analysis studies. Quade (1975) describes policy analysis as: “any type of analysis 
that generates and presents information in such a way as to improve the basis for pol- 
icy makers to exercise their judgment” (p. 4). A commonly used technique in policy 
analysis studies is forecasting. Dunn (1981) suggests that forecasting techniques are 
suited for problems related to uncertainty, especially those that are “messy” or “ill 
structured” (p. 195). This article reports on a study which utilized a forecasting tech- 
nique known as Policy Delphi. 

The purpose of the study was to identify those changes in education, over the 
next five years, which were both probable and possible through the continued intro- 
duction of computers, as well as to identify potential patterns for arriving at a 
normative (desired) future. As previous studies (Barrington, 1981; Sellinger, 1984) 
had reported problems related to the Delphi technique, an additional purpose of the 
study was to further refine the Delphi, particularly in the area of maintaining panelist 
support. The study clarified some of the immediate concerns related to computing in 
education as well as exposed future policy options and issues related to a continued 
introduction of the technology. In addition, it provided a framework for collecting in- 
formation relevant to policy making. 


Methodology 
Yeakey (1983) characterized policy as “the culmination of action and inaction of the 
social system in response to demands made on it” (p. 256). In their definition of policy 
analysis and policy studies Nagel and Neef (1980) emphasize the importance of the 
study of alternative policies: “Policy analysis or policy studies can be broadly defined 
as the study of the nature, causes, and effects of alternative public policies” (p. 15). 
Dror (1971) also recognizes the importance of studying alternative policies in a future 
context: “The establishment of explicit alternative futures assumptions, to serve as 
contexts for concrete policymaking, is an important contribution to the improvement 
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of policymaking” (p. 70). Stokey and Zeckhauser (1978, p. 22) represent the study of 
alternative policies as similar to an economic model wherein there is a possibility fron- 
tier and decision makers must decide among the alternatives available. On a basic 
level, they describe it as a “fundamental choice model” that involves two primary ele- 
ments: 

1. Identifying the alternatives open to the decision maker; and 

2. Determining his preferences among these alternatives. 


The rationale for the study was derived from these and other aspects of the policy 
literature that relate to establishing policy alternatives. The study first identified 
alternative futures related to computer technology and education and then explored 
policy alternatives for achieving a desired future. Essentially, using definitions pro- 
vided by Dunn (1981, p. 143), it identified those futures that were plausible, potential, 
and normative, and then identified potential patterns for arriving at a normative fu- 
ture. 


The Delphi 

A Delphi is an iterative procedure which allows a group of individuals to collectively 
address complex problems. It is usually future oriented. An essential feature of a 
Policy Delphi, unlike the more common Conventional Delphi, is that it is designed to 
explore policy issues and does not strive for consensus. The study involved three dis- 
tinct iterations or rounds. The first round identified those changes in education which 
were either probable or possible within the next five years as a result of the continued 
influence of computers. The second round involved a rating of the desirability and 
probability of those changes identified as either probable or possible. The third round 
explored potential patterns for achieving a normative future. 

Critical to the success of Delphi studies is the Delphi panel. The Delphi panel was 
identified through a two-step selection procedure. In the first step, six prominent edu- 
cators who represented different organizations were selected on the basis of their in- 
volvement with computers. After participating in the pilot study and the first round 
of the Delphi, these individuals were asked to nominate other participants on the 
basis of supplied criteria. An additional 12 participants were obtained for the study on 
the basis of nominations received from this nucleus. Hence participants in the study 
panel consisted of 18 individuals with a knowledge of computers and a background in 
education. Participation included representation from central office, government de- 
partments, industry, in-school personnel, and university. A 100% participation rate 
was maintained throughout the three rounds of the study. Cost constraints delimited 
the study to the province of Alberta. However, as at the time of the study Alberta was 
one of the leading provinces/states in terms of computer-related acquisition and 
activity (Petruk, 1985), the environment provided a very suitable setting in which to 
identify futures oriented policy issues and concerns. 


Data Collection and Analysis 
Data for the study were collected through a preliminary pilot study followed by a 
three-round Delphi. These data were collected from April 1984 to January 1985. The 
instrument used in each round of the study was developed on the basis of information 
collected in the previous phase. 

The pilot study consisted of six interviews. These interviews were designed to ex- 
plore concerns related to the impact of computers on education and help conceptual- 
ize the problem to be explored throughout the Delphi technique. Information 
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obtained in the pilot study was used to develop the Round I instrument. 

The purpose of Round I was to identify those developments within education 
which were either probable or for which there was potential as a result of the con- 
tinued introduction of computers to education. Data collection in Round I was by way 
of interview. The interview guide was developed from information obtained in the 
pilot study as well as the literature. The 18 participants were asked to consider educa- 
tional computing and then identify the major plausible and potential futures in the 
following distinct areas: acquisition and funding, courseware, curriculum, organiza- 
tion, and teacher training. In addition to the five areas identified, participants were 
asked to comment on any other areas they felt were important to the study. Data ob- 
tained in the interviews were summarized, categorized, and synthesized in a series of 
nine steps. This process provided 100 statements about future developments in edu- 
cation as a result of the continued introduction of computers. These statements were 
used in the development of the Round II questionnaire. 

The purpose of Round II was to rate the overall probability and desirability of 
statements identified in Round I. The questionnaire consisted of the 100 statements 
about the future which were grouped into the following eight categories: acquisition 
and funding, courseware/software, curriculum, teacher training, organizations, ethi- 
cal concerns, equity, and other areas. Participants were asked to rate the probability 
and desirability of each of the statements. Each questionnaire item required two sepa- 
rate responses wherein the respondent had five choices for each response. The choices 
related to probability were: highly probable, probable, uncertain, improbable, and 
highly improbable. The choices related to desirability were: highly desirable, desir- 
able, uncertain, undesirable, and highly undesirable. Analysis procedures related to 
Round II included reducing these five choices to three categories, a positive, uncer- 
tain, and negative. A frequency count was done for each of the items and the informa- 
tion cross-tabulated. A 3x3 matrix was produced for each item on the basis of ratings 
of probability and desirability. Seven categories were created from the 
cross-tabulation on the basis of participant rating of probability and desirability. 
These categories were: (a) Agreement on probability/no agreement on desirability; 
(b) Agreement on desirability/no agreement on probability; (c) Agreement on proba- 
bility/no agreement on desirability; (d) Agreement on probability/low desirability; 
(e) Agreement regarding low desirability/no agreement on probability; (f) Agreement 
regarding low desirability/improbability; (g) Scattered items. Four of these categories 
were later used in development of the Round III instrument. 

The purpose of Round III was to identify potential patterns for achieving a 
normative future. A questionnaire was again used for this final round. It consisted of 
24 statements which were selected from the original 100 statements used in Round II. 
These statements were selected from categories identified in Round II. Four categor- 
ies were selected on the basis of containing information relevant to the purpose of 
Round III. Each of these four categories was represented in a separate section of the 
questionnaire. Section I included 16 statements which had been regarded as highly 
probable as well as highly undesirable. Participants were asked to recommend strat- 
egies for reducing the probability of these developments. Section II included four 
statements which had been regarded as highly desirable with little agreement regard- 
ing probability. Participants were asked to recommend strategies for increasing the 
probability of these developments. Section III included two statements which had 
been regarded as highly probable with little agreement regarding desirability while 
Section IV included two statements which had been regarded as highly undesirable 
with little agreement regarding probability. Participants were also asked to make 
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specific comments regarding each of the statements in Sections III and IV. Data col- 
lected were analyzed through a seven-step procedure which included categorizing, 
summarizing, and frequency analysis. In addition, participants were asked to consider 
each of the 24 statements and specify the greatest levels of concern from each of the 
following: provincial, regional, system, school, and university. Frequency analysis was 
used to analyze this data. 

An additional purpose of the study was a further refinement of the Delphi tech- 
nique. Design features of the study attempted to address problems identified in earli- 
er studies. Particular attention was devoted to addressing the problem of maintaining 
panelist support (Barrington, 1981, p. 265; Bright, 1978, p. 42; Sellinger, 1984, p. iv). A 
100% participation rate was maintained throughout the study. Major design features 
of the study which may be attributed to achieving this unusually high participation 
rate include the following: conducting an interview-based preliminary pilot study, use 
of an expanding nucleus nomination procedure to select participants, establishing 
early personal contact with participants through the Round I interviews and main- 
taining that contact throughout the study, procuring panelist support at the com- 
mencement of the study, selective anonymity wherein participants knew the names of 
other panel members while individual responses remained anonymous, field testing 
instruments, limited use of the mail system, and minimal turn-around time. 


Major Findings of the Study 
The study identified a wide variety of developments which were regarded as both 
probable and desirable. These developments were described as being either a reaction 
to technological change or part of a continued technological experiment wherein 
much of the present computer-related activity could be justified on the basis of ex- 
ploiting a future potential. Developments which were identified as probable and un- 
desirable included: underutilization of equipment, problems related to the allocation 
of equipment, lack of appropriate courseware/software, piracy, confidentiality of files 
and information, equity, funding, resistance to change, software and hardware inertia 
restricting upward capability of hardware, migration of human resources, and in- 
creased and conflicting demands on educational organizations. Developments identi- 
fied as desirable with little agreement regarding probability included: development of 
a local software industry tailored to provincial curriculum, a strong emphasis on 
computer-related in-service, an awareness of the social impact of computers, and in- 
creased parent involvement in school matters. Major potential patterns for achieving 
a normative future included: increased courseware/software development and evalua- 
tion, increased levels of funding, inservice, a recognition of the need for policies re- 
lated to educational computing, curricular development, preservice, planning, 
security procedures, and communication among organizations and stakeholders. 
Overall, the Delphi provided useful information for futures oriented planning and 
policy making. The findings tended to support the view that realizing the educational 
potential of computer technology will continue to require a great deal of continued 
activity related to policy making. This activity, while not necessarily uniform, needs 
to occur at a variety of decision making levels. The following conclusions and implica- 
tions are presented as considerations for educational policy makers. 
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Conclusions and Implications 

Nine major conclusions regarding developments in computer technology and educa- 
tion are presented below. As Rounds II and III of the study were used to refine infor- 
mation collected in the pilot study and Round I, the conclusions are based primarily 
on data collected in Rounds II and III. These conclusions begin with the general and 
move to the more specific. Following each conclusion is a brief discussion of implica- 
tions. An attempt is also made to incorporate aspects of the literature in the discus- 
sion of implications. 

1. Computer technology, like other technological developments, has the poten- 
tial to have positive, negative, or neutral effects on the educational enterprise. The 
nature of these effects will largely be determined by the actions of educational policy 
makers. Policy making relevant to computer technology would be vastly improved if 
we placed more emphasis on determining our educational preferences and prior- 
itles. 

In addition to identifying benefits, the study identified a number of potential 
negative outcomes related to the impact of computer technology on education. This 
finding suggests that viewing computers as a panacea for all that ails education has 
the potential to do much harm. A critical role for policy makers is to ask questions re- 
lated to the negative effects of computer technology. More importantly, meaningful 
computer-related changes must be founded on substantial values if they are to be of 
lasting worth to education. In this context, computers may be viewed primarily as a 
catalyst, and the major question is not what should we do with computers but rather: 
What should we be doing in education? 

2. There will be a continued commitment to computer technology. This commit- 
ment will be reflected in a continuation of computer-related activities at all grade 
levels. Much of this activity will be justified on the basis of a general technological 
experiment. Policy makers will continue to assume that computers will improve 
educational practice. This improvement will be viewed as taking place within the 
existing educational structure. 

A continued commitment to computer technology will consume a large amount of 
human and financial resources. Despite this commitment, the findings of this study 
and the literature (Bork, 1984, p. 242; Shavelson et al., 1984, p. 30; Sheingold, Kane, & 
Endreweit, 1983, p. 431) express a concern that computer technology is having a lim- 
ited impact on the instructional process. Clearly, a major challenge lies in further ex- 
ploiting the potential of computers in education. Exploiting this potential will involve 
further experimentation, but much of this experimentation will be of little worth un- 
less there is a mechanism for identifying, communicating, and implementing those 
activities which are truly beneficial to education. Also, while experimentation is nec- 
essary, it is inevitable that attempts will be made to use computers for activities for 
which they are inappropriate. In the process of experimentation a critical question 
that educators must ask is one posed by Fullan (1982): “Can rejecting a proposed edu- 
cational program be more progressive than accepting it?” (p. iv). Given the massive 
amount of resources devoted to computer technology, this question should also be 
asked on a broader, macro level. If future computers are truly capable of delivering 
what they promise in terms of ease of human interface, they will be, as Weizenbaum 
(Ebisch, 1984, p. 36) suggests, nearly transparent. If this is the case, and computers do 
indeed become so transparent that we need very little knowledge to actually use them, 
many of the resources presently being devoted to computer technology are not neces- 
sary. There is a possibility that many present-day computer-related activities might 
soon be described as paralleling the Sputnik phenomenon where there was a race to 
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train individuals in areas for which society later had very little demand. 

3. Technological change is inevitable. A commitment to technology in education 
is acommitment to ongoing change. Change will occur as an ongoing continuous pro- 
cess. Rapid technological developments will further compound problems related to 
change in education. All educational organizations will experience changes precipt- 
tated by computer technology. 

While meaningful computer-related change is possible, realizing the potential of 
computers largely depends on developing appropriate strategies for dealing with 
change. “One-shot” acquisition of equipment or “one-shot” inservice will not suffice. 
The rapidly changing technology requires strategies that are ongoing, especially in 
areas related to acquisition, curriculum, and teacher training. In this regard Fullan’s 
(1985, p. 392) notion that change is a process is reinforced by Uhlig (1983) in the area 
of technology: “Technological literacy, unlike some other kinds of literacies, is not an 
event; it is a continuous process” (p. 2). Key issues for policy makers to address in rela- 
tion to the “race” with technology is whether we are playing “keep up,” “catch up,” or if 
we should even be in the “race.” Developing strategies to deal with the rapid process of 
change presents a mammoth challenge. It is likely that the technological capability to 
produce new and innovative computer devices will continue to be far ahead of our 
ability to use them in education. Somewhat illustrative of this problem are the many 
conferences related to computers in education which often more closely resemble an 
industry “show and tell” more so than they represent any meaningful dialogue related 
to pedagogy. Mumford and Sackman (1975) suggest: “Society should deliberately lead 
and direct the application of computers in the image of its most cherished values and 
ideals rather than be the unwitting victims of the vagaries of technology and the fluc- 
tuations of the market-place” (p. v). What is required from policy makers is a focus 
upon developing a clearer philosophy of technology. 

Resistance to change is often cited as a major problem facing the implementation 
of computer technology. Ironically, the technology itself will foster its own resistance 
to change as it creates problems related to the “saber-tooth” curriculum (Peddiwell, 
1939). For example, sunk costs in software, hardware, or training may mean that edu- 
cational organizations as well as individuals have a vested interest in maintaining cer- 
tain learning environments. Given that much of the recently acquired equipment is of 
an older generation, policy makers should address the following questions: Has much 
of the recent activity, aimed primarily at acquiring large numbers of computers, been 
inappropriate? Have we already installed a base which will be resistant to further 
technological change? Has there been an error in identifying the problem related to 
computers in education as being primarily one of acquisition? 

4. Computer technology will provide the impetus to ask basic pedagogical ques- 
tions related to curriculum and teacher training. There will be continued experi- 
mentation related to the organization and dissemination of knowledge. 

This conclusion suggests that computer technology will provide a powerful impe- 
tus to ask questions fundamental to curriculum, including a reconceptualization of 
both knowledge and the delivery of instruction. Such a reconceptualization will also 
involve a change in the role of the teacher and will be reflected in teacher training pro- 
grams. These developments have the potential to profoundly affect education in the 
long term. 

5. Most changes related to computer technology will be external to the learning 
process. While there will be widespread experimentation, basic curriculum content 
will remain unchanged. Areas such as business education and computing science 
will continue to assimilate the technology. 
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There are two major implications, representing opposite scenarios, related to this 
conclusion. The first is that meaningful change takes place over time and the next five 
years will continue to be an assimilation period whereby technology follows the path 
of least resistance while at the same time improving existing technologies and prac- 
tices. These activities will continue to be viewed as providing a foundation for realiz- 
ing a much greater potential some time in the distant future. Such an interpretation 
justifies much of the current computer-related activity on the basis of a general tech- 
nological experiment. The second implication of this conclusion is that organizations 
may be avoiding meaningful changes through the assimilation of technology 
(Sheingold, Kane, & Endreweit, 1983, p. 431). If the technology is being assimilated 
and little meaningful change is actually occurring, a tremendous amount of resources 
are currently being wasted. 

Somewhat related to this conclusion is the ongoing implication of attempting to 
cope with the myth and the reality of computer technology. First, expectations often 
far exceed the reality of what the technology can deliver. Second, without adequate 
consideration for the realities of the change process, there is no guarantee that the 
pedagogical applications, which are possible in theory, will automatically find their 
way to educational practice. Decisions that continue to be based on the myth will in- 
evitably lead to disillusionment. 

6. Developments related to computer technology will place increased demands 
on teachers. These demands will be reflected in increased preservice and inservice 
activity. The teacher’s role will gradually begin to change from a disseminator of 
knowledge to a facilitator of learning. 

Charters and Pellegrin (1972) cite as one of the major problems related to educa- 
tional change: “The failure to recognize the severity of role overload among members 
of the instructional staff when innovation is attempted” (p. 12). The findings of the 
study identified computer technology as contributing to role overload among teach- 
ers. Policy makers need first to recognize that this overload exists and then incorpor- 
ate strategies to deal with it when developing policies related to computer technology 
and education. 

7. All educational organizations will be required to make decisions and establish 
policies related to computer technology. Many of these decisions will be made at the 
school, system, and provincial levels. While there is a need for policy at all levels, de- 
cisions made at the school level will have the most significant impact regarding the 
effective utilization of computer technology. Policies will be established both 
through action and inaction inasmuch as organizations that do not respond to the 
changing technological environment will be establishing policies through their 
omission. 

It is of critical importance that policy makers recognize the importance of the in- 
stitutional setting of the school with regard to the use of computers. The findings of 
the study and the literature on change suggest that successful adaptation of an inno- 
vation largely depends on activities at the school level. Berman (1980) provides 
additional insight related to the implications of this conclusion for policy develop- 
ment: “once policy makers dispense with the image that implementation must be uni- 
form for all policy situations, invariable over time, and homogeneous across 
organizational levels, thay can search for matching, mixing, and switching strategies 
to improve policy performance” (p. 222). 

8. Continued efforts at realizing the potential of computers will create a number 
of issues and problems for policy makers. The study identified the following major 
problems as requiring consideration: (a) underutilization of equipment, (b) al- 


89 


L.P.Steter 


location of equipment, (c) lack of appropriate courseware/software, (d) piracy, 
(e) confidentiality of files and information, (f) equity, (g) funding, (h) resistance to 
change, (i) software and hardware inertia restricting upward capability of hard- 
ware, (j) migration of human resources, (k) increased and conflicting demands on 
educational organizations. 

Each of the problems is discussed separately below. 


Underutilization of Equipment 

A general public support for computers, compounded with generous funding alloca- 
tions, has resulted in the recent acquisition of a great deal of computing equipment. 
Much of this newly acquired equipment will be underutilized. This underutilization 
may occur at two levels. First, machines that have been acquired to take advantage of 
existing grants may see limited physical use. Second, because machines see maximum 
physical use does not mean that they are being properly utilized. For example, there 
has been concern over the misuse of games and drills and practice activities 
(Shavelson et al., 1984, p. 31; Sheingold, Kane, & Endreweit, 1983, p. 427). Given the 
large amount of resources being devoted to computers in education there may occur a 
negative backlash of support for computers should the perception increase that com- 
puters are not being properly utilized and that some of the resources could be better 
devoted to other activities. Large-scale acquisition and widespread physical use of the 
machines does not guarantee that the potential of computers in education will be real- 
ized. 


Allocation of Equipment 

Computers are being acquired at a rapid rate. They are presently housed in a variety 
of configurations and used at all grade levels. A major issue is how to allocate equip- 
ment in order to maximize benefits from its use. As Peterson (1984) observes, equal 
distribution is fair and simple; however it often does not provide the critical mass nec- 
essary for “ideas and creativity to take off” (pp. 11-12). Configurations such as labs 
may provide a sense of fairness as well as the necessary critical mass; however, they 
have the disadvantage of portraying computers as something separate and apart from 
the curriculum. While increased acquisition will undoubtedly alleviate some of the 
problems related to access it will also create new ones. For example, as newer genera- 
tion equipment is acquired who will get access to it? Administrators? Innovative 
teachers? Designated grade levels or subject levels? Will older generation or 
underutilized equipment be passed on to less than enthusiastic recipients or merely 
placed in storerooms? Policies regarding equipment allocation as well as migration 
will have to be established. 


Lack of Appropriate Courseware/Software 

Both the findings of this study and much of the literature (Alberta Education, 1983, 
pp. 57-58; Bork, 1984, p. 240; Komoski, 1984, p. 247; Rockman, White, & Rampey, 
1983, p. 42; Sheingold, Kane, & Endreweit, 1983, p. 429) strongly suggest that lack of 
appropriate courseware/software is a major area of concern. This problem has been 
magnified by policies allowing the large scale acquisition of computers. Large scale 
acquisition, without adequate consideration for applications, often results in the use 
of inappropriate software programs simply because they are readily available. The 
problem of selecting appropriate courseware/software is further compounded by the 
extremely high ratio of poor quality programs being marketed for use in education. 
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Piracy 

Lack of appropriate courseware/software, high costs, inadequate funding, the need 
for multiple copies of a program, and inadequate copyright laws will all contribute to 
problems related to piracy. This problem will further reduce the development of qual- 
ity courseware/software when developers do not receive fair compensation for their 
efforts. A further problem, in major conflict with a fundamental goal of the educa- 
tional enterprise, is that students who witness acts of piracy are being encouraged, 
through example, to participate in activites that are dishonest and illegal. 


Confidentiality of Files and Information 

The increased reliance on computers, especially in tasks related to administration, 
will create problems related to confidentiality of information. These problems will be 
further compounded by the increased use of computing networks. Two major areas 
that will have to be dealt with are those related to unauthorized access and the estab- 
lishment of policies related to access. 


Equity 

Computer technology has the potential to remove some of the problems related to 
equity. For example, improvements in networking and storage capabilities will allow 
smaller schools to increase their level of service. However, there are also problems as- 
sociated with the equity issue. Two major dimensions of the issue are actual physical 
access to computers, and applications to which students are exposed. Not all school 
districts are adopting computers at a similar rate; consequently, this will lead to prob- 
lems for students who do not have a background in computing, particularly those who 
enter postsecondary institutions. Should problems related to equality of access be 
overcome, it will not necessarily lead to equality of opportunity. Decisions related to 
the allocation and configuration of equipment, as well as how computers are used for 
instruction, will vary within districts, schools, and classrooms. There is also concern 
that individuals from higher social class and boys are receiving the most benefit from 
computer instruction. If computers really do make a difference in education, then pol- 
icies that further address the problem of equity are required. 


Funding 

A substantial amount of funds have already been allocated to educational computing, 
a large portion of which has been designated for the acquisition of equipment. A com- 
mitment to technology requires a further significant ongoing commitment of funds in 
areas such as the acquisition of new equipment, courseware/software development 
and acquisition, curriculum, and teacher training. Related to this issue is the problem 
of obtaining significant levels of funding accompanied with questions related to 
whether it is worth allocating funds for computing that might be better directed to 
other areas. Theoretical speculation about the pedagogical promise of this technology 
is widespread. However there is comparatively little literature which addresses the fi- 
nancial cost of realizing this promise. 


Resistance to Change 

Computer technology, like other innovations, will meet resistance to change. Fullan 
(1985) describes attempts at initiating change as “a complex dilemma-ridden, techni- 
cal, sociopolitical process” (p. 391). There is the constant problem of technology 
having limited impact on schools, or as Sheingold, Kane, and Endreweit (1983) 
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concluded in their study, “The study more strikingly illustrates the assimilation of 
technology by school systems than the impact of technology on them” (p. 431). 
Policymakers must be prepared to deal with the complexities of initiating change. 
Fundamental to the development of policy (Berman & McLaughlin, 1976) is that it be 
“concerned with more than the mere adoption of change agent projects,” and take into 
account “the critical significance of the institutional setting” (p. 347). 


Software and Hardware Inertia Restricting 

Upward Capability of Hardware 

Continued new developments in technology will make further change inevitable. 
Sunk costs in areas such as hardware, courseware/software, and training may act to 
reduce an organization’s capacity to further adapt to change. For example, an organi- 
zation with a large investment in existing hardware and software may resist opportun- 
ities to adapt to new generation equipment which requires a significant departure 
from what already exists. Papert (1979) describes an additional dimension of this 
problem when he portrays existing computer-related practices as sharing “a model of 
education which leads them to reinforce traditional educational structures and thus 
play a reactionary role, opposing the emergence of radically new forms of education” 
(p. 74). Policies are required which reflect the long term nature of computer-related 
change. 


Migration of Human Resources 

Many talented, knowledgeable people in the area of computing will leave education. 
Some of these individuals will pursue careers in business and industry while others 
will simply burn out. Many other successful computing teachers will also migrate to 
administrative positions. Policies are required that address the problem of retaining 
those individuals who can make a positive contribution to education. These policies 
would also address the problem of role overload and burn out. 


Increased and Conflicting Demands on Educational Organizations 

Computers have simultaneously been presented as a panacea for all that ails educa- 
tion and as having limited significant impact. Kohl (Ebisch, 1984) describes them as 
having the potential to reinforce the “worst practices of schools” or become a “tool to 
extend the capacities of ones mind” (p. 38). They have also been presented as having 
little impact on the instructional process (Bork, 1984, p. 242; Shavelson et al., 1984, 
p. 30; Sheingold, Kane, & Endreweit, 1983, p. 431) yet potentially fostering revolu- 
tionary changes in curriculum (Henchey, 1982, p. 13; Papert, 1980, p. 140) and teacher 
training (Alberta Education, 1983, p. 36; Friedman, 1983, p. 16). Weizenbaum 
(Ebisch, 1984) presents computers as “a powerful distraction that will leave the origi- 
nal problems—money, teachers, time, and energy—untouched” (p. 36). Conflicting 
viewpoints such as these will increasingly be reflected through demands placed on 
educational organizations. This problem will be compounded by rapid changes in the 
actual technology. Policy makers will have to continue to address the role of comput- 
ers in education. 

9. Major potential patterns or strategies for policy makers to consider when 
dealing with the impact of computer technology include the following: increased 
software/courseware development and evaluation, increased levels of funding, in- 
service, a recognition of the need for policies related to educational computing, cur- 
ricular development, preservice, planning, security procedures, and communication 
among organizations and stakeholders. 
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Much of the recent activity related to educational computing has been directed at 
acquisition. However, as this technology continues to place increased and conflicting 
demands on educational organizations, policy makers must recognize the need for fur- 
ther policy development. Enhanced communication and dialogue for the purpose of 
establishing policies relevant to technology, will become increasingly necessary. 
Rapid change, precipitated largely by technology, makes the development of ongoing 
strategies an essential activity of educational organizations. Realizing the educational 
potential of computers, in addition to requiring strategies that address technical is- 
sues, requires strategies that address people-oriented issues. Developing these strat- 
egies will require a huge amount of resources. 


Contribution to the Study 

Decision makers have always required information relevant to policy making. An im- 
portant aspect of policy making is the establishment of alternative futures to serve as 
contexts for policy making. The rapidly changing technological environment, precipi- 
tated largely by the advent of the computer, has accelerated the need for information 
relevant to policy making. The study provided information relevant to educational 
policy making by clarifying some of the immediate concerns related to computers in 
education as well as exposing future policy options and issues related to a continued 
introduction of the technology. It also illustrated that computer technology is not a 
panacea for all that ails education and is more appropriately viewed as a catalyst for 
basic philosophical questions related to the general role of technology, change, curric- 
ulum, and teacher training. Finally, it emphasized that we have to have a clear vision 
of our educational priorities before we can begin to seriously address questions related 
to what to do with computer technology. 
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The relationships between the quality of writing on composition examinations and 
topic selection, outlining, rough drafting, and revision were investigated using a sam- 
ple of 1,372 grade 12 students’ essays. Significant relationships were shown between 
the essay topic selected from a number of topics and the score obtained, and between 
the use of an outline and rough draft together and the score obtained. Qualitative an- 
alysis of revisions suggests a qualitative difference relative to score for some revision 
categories, particularly stylistic and rhetorical. For every 1,000 words of rough draft, 
students average approximately 70 revisions in their final drafts. Weaker students re- 
vise as much as abler students and across the same revision categories. The purpose of 
all revising undertaken by senior high student writers can be established in one of six 
categories: mechanical, lexical, syntactic, stylistic, figurative, or rhetorical. 


Writing research (Britton, 1975; Christensen, 1967; Hunt, 1965, 1970; Mellon, 1969; 
O’Hare, 1973) and much theory (Britton, 1970; Diederich, 1974; Kinneavy, 1971; 
Moffet, 1968) have tended to concentrate on the writing process from a maturational 
or evaluative rather than a taxonomic perspective; that is, attempts have been made 
to show the differences in writing among various age groups, or to show the types of 
errors students make in their writing. However, the writing process itself has not been 
fully described categorically—it may never be—nor has the classification of the com- 
ponents of the process been completely examined empirically for its message to teach- 
ers of composition. 

In 1975, at the Conference on Describing Writing and Measuring Growth in 
Writing, the following questions were asked (Odell & Cooper, 1977, p. vi): 


1. 


When we describe writing and measure students’ growth in writing, what should we 
look at? Should we examine only the written product? Should we examine the process 
or strategies which generate that product? In either case, what facets of the product, 
what aspects of the process should we examine when we describe/measure so complex a 
thing as growth in writing? 

How can we find out how well an individual performs at a certain stage in his or her de- 
velopment as a writer? To ask it another way: How can we determine precisely what a 
writer is doing and not doing so that we can plan efficient, focused instruction? [Italics 


added] 
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3. Howcan we measure growth in writing over a period of time? 
4. How can we involve students in the evaluation of writing? 


Researchers have already made considerable strides toward answering several of 
the above questions. For example, in answer to Question 3, and the first part of Ques- 
tion 2, Hunt, in his research on early and late blooming syntactic structures (Odell & 
Cooper, 1977), has shown that certain syntactic structures do not appear in the writ- 
ing of students before certain age levels. Others (Mellon, 1975; O’Donnell, 1976; 
O’Hare, 1973) have demonstrated that particular instruction, especially in sentence- 
combining techniques, will increase the early use of some syntactic structures signifi- 
cantly. Longitudinal studies to assess the lasting effects of such instruction have yet 
to be conducted, and the entire question of the quality of writing as it relates to the 
use of certain syntactic structures has yet to be fully assessed (cf. Dunn, 1983). 

Odell (Odell & Cooper, 1977) raises the questions: How do we identify the intel- 
lectual processes implicit in students’ writing? How do we determine whether stu- 
dents are using these processes as fully and effectively as they might? He observes 
further that 

If we want to make useful diagnoses or formative evaluations of student writing—that is, if 

descriptions of students’ present writing are to be used in helping them improve subsequent 

writing—we must have some insight into their use of these processes. In making this claim I 

am assuming that: (1) Although thinking is a complex activity, the number of conscious 

mental activities involved in thinking may not be infinite... (2) We can identify linguistic 
cues—special features of the surface structure of written or spoken language—that will 
help us determine what intellectual processes a writer is using. (3) In order to improve stu- 
dents’ writing, we will have to determine what intellectual processes we want students to be- 

gin using, continue using, or use differently; to make this determination, we must have a 

good sense of how they are presently functioning. (p. 108, italics added) 

The research of Hunt (1965, 1970) with regard to when and how students use cer- 
tain syntactic structures has provided some of the insights sought by Odell. The re- 
search in this study, which elaborates further on the writing processes of students, 
examines, in answer to the first two questions posed by Odell and Cooper above, a 
written product, specifically composition examinations at the grade 12 level; it exam- 
ines that product in an attempt to ascertain what writers appear to be doing in start- 
ing and revising their work, that is, in making choices, and in identifying the types of 
choices made. It addresses Odell’s questions: (a) How do we identify the intellectual 
processes implicit in students’ writing? (b) Are there linguistic categories present in 
the surface structure of student compositions that help identify the intellectual pro- 
cesses used by the writers? (c) If there are, will knowledge of these assist teachers in 
improving students’ writing? 

In 1978, the provincial English Written Composition Achievement Test was 
administered to 12,695 grade 12 students in the province of Alberta. The students 
were directed to choose one topic from six provided; space for rough work, suggestions 
for writing, and categories for scoring the finished work were given in the examination 
booklet.! The compositions were scored holistically on a four-point scale by marking 
teams composed of Alberta English teachers under the direction of Alberta Education 
and university and school system supervisory personnel. All markers had undergone 
earlier workshop sessions in holistic scoring procedures. 

The research takes its point of departure from a pilot study done by Emig (1971, 
p. 25) who examined the assumption that student writers, first of all, do outline and, 
secondly, that outlining prior to composing results in writing of superior organization. 
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Both writing an outline and writing a rough draft have been advocated by teachers as 
the route to quality composition. The Alberta Composition Achievement Test book- 
lets, for example, provide identified space for such work. 

The Emig study, based upon 109 expository themes written by 25, high-honors, 
grade 11 English students during an eight-week period, identifies approximately 37 % 
of the themes as preplanned in any fashion, and only 8% of the themes as formally 
outlined. Emig found no correlation between the presence or absence of an outline, 
formal or otherwise, and the grades received. She also found, in questioning profes- 
sional and academic writers, that there was “a great diversity and individuality” in 
their writing practices. Do these findings hold true for compositions written under 
test conditions where revision of a rough draft, if not explicitly required, is at least im- 
plied by the space provided? Is there a significant diversity in the revision practices of 
students relative to the quality of their writing? And, ultimately, the question remain- 
ing is: Does writing activity such as outlining and preparation and revision of a rough 
draft result in improved writing or not? Is it a useful activity for some students more 
than for others, or does it matter with some topics more than with others? 


Purpose 
This study reexamines both Emig’s (1971) conclusion with respect to the use of the 
outline and the widely held assumption that writing a rough draft and carrying out ex- 
tensive revisions on it results in superior writing. The changes which occur between 
the rough draft and final composition of meritorious papers, those scored 4, as com- 
pared to the changes which occur on papers of lesser merit, those scored 1, 2, or 3 are 
analyzed. As opposed to discussion of writing maturity or composition evaluation per 
se, the research is directed primarily toward classification and description, based on 
an examination of some of the characteristics of the product, which might affect the 
process leading to quality composition. The research attempts to further our under- 
standing of the writing process in answer to the basic question posed by Odell (Odell 

& Cooper, 1977) cited above: How can we determine precisely what a writer is doing 

and not doing so that we can plan efficient, focused instruction? 

An additional task of this study is to examine the significance of topic selection 
(Table 1) on the quality of the final composition. Also, to accompany the quantitative 
analysis of the above factors, a qualitative analysis is attempted to determine what, if 
any, substantive differences exist in the changes made between rough draft and final 
composition on 4-score papers as compared to those made on an equal subsampling of 
1-, 2-, and 3-score papers. 

In summary, the problems investigated may be stated as follows: 

1. Do student compositions which utilize an outline and/or a rough draft rank 
higher by teacher evaluation than those which do not? 

2. What revision occurs between a rough draft and a final draft on essay tests, for 
what purposes, and does this revision differ substantively with respect to high 
and low scored essays? 

3. Does the selection of a topic from a number of topics affect the type of revisions 
made between a rough draft and a final draft on essay tests, and, if so, how? Does 
topic selection affect the final score, and, if so, how? 
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TABLE 1 


Theme Topics 


eee 


Topic 1 The most important freedom is the freedom to pursue one’s dreams. 

Topic 2 Modern society treats young people and old people in the same ways. 

Topic 3 Money and material goods are not necessarily the measures of success and 
happiness. 

Topic 4 Nature in Alberta expresses itself in beautiful extremes. 

Topic 5 Young people should have a greater say in government for it is their future that is 
being shaped. 

Topic6 It is amyth that young people are rebellious. They are, in fact, of any group, the 


most docile and the most resistant to change. 


Method 

Several hypotheses were established against which to test the questions addressed in 

this study: 

1. Where a choice of topics is offered, the topic selected for an essay does not bear a 
significant relationship to the score obtained. 

2. The type and amount of revision between rough draft and finished composition 
are not significantly related to the topic chosen. 

3. Students who employ an outline and/or a rough draft do not score significantly 
higher on their essays than those who do not use an outline and/or a rough draft. 

4. ‘The amount of revision from rough draft to finished product in each revision cat- 


egory is not significantly related to the quality of the finished composition. 


To assist in the investigation of these hypotheses, a series of questions was set: 


1 
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How many papers in the sample were written on each theme and what scores did 
they obtain? 

How many papers in the sample use an outline only, formal or informal, and what 
scores did they obtain? 

How many papers in the sample use a rough draft only, and what scores did they 
obtain? 

How many papers in the sample use both an outline and a rough draft, and what 
scores did they obtain? 

How many papers in the sample use neither an outline nor a rough draft, and 
what scores did they obtain? 

What are the categories of revision used by students in making changes between 
rough and final drafts? 

What is the frequency of revision in each category for each score and each topic? 


As well, three questions were asked relative to the quality of prewriting activities: 


i 
Z 


Is there a qualitative difference in the information load of the topics offered? 
Is there a qualitative difference in the outlining and rough drafting processes 


used by students? 
Is there a qualitative difference in the revisions for each category relative to 


score? 
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An outline was defined as “any schema related to the composition of the theme, 
prior to the theme (informal) . . .; numbers or letters may precede the items, and there 
may beat least one level of indentation (formal)” (Emig, 1971, pp. 26-27). 

A rough draft was defined as all written material, excluding any outlining, which 
the student produced (and submitted) exclusive of the final copy. This usually in- 
cluded all material on the left-hand side of the examination booklet pages in the 
“rough work” space, as well as any interlinear changes, additions, or deletions that 
might have been made to that rough draft writing during or after its initial composing 
process. Hence “revision changes” to the rough draft are only those differences be- 
tween it and the final copy written on the right-hand side of the booklet in the “fin- 
ished work” space. 

In a very few instances, due either to time constraints or to the student’s intention 
to write only one draft, only one copy was produced either on the left- or right-hand 
side of the booklet. In these cases, since the score is based on the single copy as if it 
were a final draft, the writing was treated as final draft without a rough draft. Word 
counts, based on the number of words in the rough draft, do not include these papers; 
nor, of course, do these papers provide any revision counts. 

The sample was analyzed first for Answers to questions one through five above. 
These data are reported in Table 2. A dozen papers, three from each score, were ana- 
lyzed to determine the revision categories which students used in making changes be- 
tween their rough and final drafts. These categories, based on the purpose of the 
revision, were identified as mechanical, lexical, syntactic, stylistic, figurative, and rhe- 
torical. The categories were further refined as the specifics of each were established 
through use by the researcher. For example, if one word replaced another to change 
the meaning, it was coded as a lexical revision. If the same word change constitutes a 
synonym replacement to reduce repetition, the revision was categorized as a stylistic 
change. 

In addition to mechanical revisions (Eley, 1953), three categories (lexical, syntac- 
tic, rhetorical), established by Emig (1971), have been utilized. In each case, the origi- 
nal definitions have been adjusted somewhat to accommodate the data. Also, two 
categories, stylistic and figurative, were added to accommodate these reasons for revi- 
sions which were found in the student compositions. It should be noted that the cate- 
gories, while still not mutually exclusive in some instances, have been found adequate 
to accommodate all types of revisions discovered in the sample. Moreover, these cate- 
gories have been found to distinguish the more specific revisions identified as me- 
chanical, lexical, and syntactic from changes which appear to have much more to do 
with quality of thought; that is, stylistic, figurative, and rhetorical. In the former 
instances, revisions more often involve correctness and/or appropriateness of text; in 
the latter, the revisions reflect to a greater extent audience awareness on the part of 
the writer. In this respect, these latter changes frequently require much more rewrit- 
ing than do the former and, it would appear, much more thought on the part of the 
writer about the revision. 

Finally, the six categories are, to some extent, hierarchical. Revisions that might 
be classified as syntactic in nature frequently involve changes in level, specificity, and 
so on, as well, in such cases, the revisions are classified according to the “highest” cate- 
gory represented by each change and recorded only once. For example, in the rough 
draft, a student wrote 


They also think that it is going to restrict their freedom by having to take care of an old per- 
son. 
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In the final draft, this sentence was changed to 


They also think that it is going to restrict their freedom by having an old person under foot 

all the time. 
The revision is syntactic in that the relationship of the parts is affected, the verbal 
being replaced by a noun. However, the revision is also figurative; the concept has 
been expanded, cliché notwithstanding—in this instance by use of a figure of 
speech—thereby increasing its concreteness. (It might also be argued that a change in 
style has also been effected.) The revision is categorized once as figurative, however, it 
being the “higher” category represented by the change. 

The categories for analysis of revision and their descriptors are listed here in hier- 
archical order: 


1. Mechanical—changes between rough and final draft which involve a correction. 
These changes might include punctuation, spelling, agreement, tense, and capi- 
talization changes; changes from abbreviated or contracted forms to 
nonabbreviated or noncontracted forms; for example, “#” to “number,” “isn’t” to 
“is not.” 

Note 1: not all punctuation changes are mechanical; use of the hyphen, for ex- 
ample, for clarifying modification (e.g., red brick house/red-brick house), or 
punctuation to eliminate fragments or run-on sentences are syntactic revisions. 
Note 2: changes which involve use of recognized acronyms such as NATO are me- 
chanical; however, changes from such abbreviations to their full form in the final 
draft are classified as stylistic. 

Note 3: transcription errors, since they do not indicate a conscious revision made 
by the student upon the text, are not counted. 

2. Lexical—changes between rough and final draft from or to a single word for 
denotation purposes only. 

Note 1: connotative changes are classified as stylistic changes, for example, politi- 
cian/statesman. 

Note 2: level changes are classified as stylistic changes; for example, told/in- 
formed. 

3. Syntactic—changes between rough and final draft, within the sentence only, af- 
fecting the relationship between the syntactic parts; that is, changing a word to a 
phrase, a phrase to a clause, predication, subordination, modification, and coordi- 
nation. 

4. Stylistic—changes between rough and final draft including connotation, voice, 
level, and clause inversion which paraphrase or are optional in nature. 

Note: stylistic revisions might be considered as microrhetorical changes. 

5. Figurative—changes between rough and final draft, which may go beyond the 

sentence boundary, to increase or decrease the specificity, concreteness, gen- 
erality, or abstraction of the text. 
Note: these revisions may be considered an expansion on the original text; that is, 
additional modification of a term which is already modified, or additional sen- 
tences to expand a concept and thereby increase its specificity and/or concrete- 
ness. 

6. Rhetorical—changes between rough and final draft, which almost always go be- 
yond the sentence, to manipulate the reader; that is, sentence type changes such 
as declarative to interrogative, beginning new paragraphs, underlining for em- 
phasis, reordering sentences, and adding sentences which emphasize rather than 


expand upon a concept. 
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A number of questions, based on the category descriptors and examples discov- 


ered in the compositions, were developed to standardize classification of the revisions 
made between rough and final drafts. These questions are not intended to be exhaus- 
tive of all the possibilities for revisions which might occur between the two drafts, but 
merely to reflect the type of changes assigned to each revision category. 


1; 
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Mechanical—Does the revision: 
correct a punctuation error? 
correct a spelling error? 
correct an agreement error? 
correct a tense error? 

correct a capitalization error? 


moaore 


transcribe an abbreviation, contraction, or number in full? 

g. reduce aseries of words to acommon acronym? 

Lexical—Does the revision involve a change in the denotative meaning of a word? 
Syntactic—Does the revision: 

a. affect the syntactic relationship of the sentence parts? 


b. introduce (or eliminate) modification in cases where the item, before (or 
after) the change, is unmodified? 


introduce a word for a phrase or clause, or vice versa? 

introduce a phrase for a clause, or vice versa? 

subordinate a previously coordinated part of the sentence, or vice versa? 
subordinate or coordinate one sentence to another? 

punctuate to eliminate sentence fragments and run-ons? 

tylistic—Does the revision: 

paraphrase the rough draft version? 
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offer an optional version of which a judgment of better or worse might be 
made? 


c. switch from active to passive voice, or vice versa? 
d. increase or decrease the stylistic level? 

e. invert text within the sentence? 

f. reduce (or introduce) repetition for effect? 

g. change the auxiliary verb? 

h. replace an acronym with its full form? 

i. involve a shift in connotation only? 


j. switch person, or noun/pronoun? 


Figurative—Does the revision: 
a. increase/decrease specificity, concreteness? 
b. increase/decrease modification of previously modified text? 


c. increase/decrease text to expand upon/reduce a concept (thereby increasing/ 
decreasing specificity or concreteness). 


Rhetorical—Does the revision: 

a. manipulate the reader? 

b. switch to the conditional mood? 

c. switch the type of sentence, for example, declarative to interrogative? 
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reorder sentences? 
begin a new paragraph or cancel one? 
introduce underlining for emphasis? 


m ho 


add (or subtract) sentences, without expanding (or reducing) the concept, to 
emphasize (or de-emphasize) a point? 

A subsample of 144 papers, consisting of all 4-score papers based on “agreement” 
scores (at least three markers agreed) in the sample (36 papers), and a corresponding 
36 papers from each of the other three scores were analyzed to provide a count of all 
revisions between rough and final drafts in the six revision categories for the six 
topics. 

The number of words in each rough draft was recorded, and the total number of 
words for the rough drafts in each score was calculated. For purposes of comparison 
between revision categories for each topic, score and between scores, the total number 
of revisions in each category for each score and topic was converted to revisions per 
1,000 words of rough draft. 

Of course, the number of words in the few papers in the subsample without rough 
drafts was not recorded as these papers were considered to be final drafts, but neither 
do these papers provide any revisions to the total. 

An independent coder analyzed 12 papers, three from each score, to determine 
the replicability of revision coding. Product-moment correlation coefficients were cal- 
culated for each category to determine the degree of intercoder reliability, with .05 
level of confidence considered acceptable. Agreement of 93.3% was achieved overall. 
Product-moment correlation coefficients calculated for each revision category aver- 
aged 0.75, significant at p<.01 level of confidence. 


Findings 
The analysis gave rise to the following findings. 
Relationship of Topic to Score 
Table 3 presents the data relevant to Hypothesis 1: Where a choice of topics is 
offered, the topic selected for an essay does not bear a significant relationship to the 
score obtained. Over a third of all students in the sample opted for topic number 
three. However, the greatest percentage of 4-score papers (11.9%) was written on 
topic number six, with topics number two (9.4% ), number five (8.9%), and number 
one (7.6%), all producing a greater percentage of 4-scores than the most popular 
topic, number three. Topic number six also produced the greatest percentage (35.6 % ) 
of 3-score papers. 

Chi-square analysis of the relationship of topic to score indicated a significant 
(p<.001) overall correlation between topic and score. Hence, the null hypothesis was 
rejected and the choice of topic was deemed to relate to the score obtained. Further 
examination of the contribution of cells to the overall chi-square revealed that topic 
number four, as well as topic number six, appeared to be related positively to score 
(p<<.05). 

During the holisic scoring of these essays, teacher-markers were heard to make 
such comments as, “If I have to read another ‘beautiful Alberta’ paper, I'll scream!” 
What effect did such a remark have on the validity of the holistic scoring of other 
markers when they next picked up a topic number four paper to score? In fact, very 
few topic number four papers (2.9% as compared to 6.3% overall) received a 4-score 
despite the topic’s being the third most popular student choice of the six offered. 
Topic number three, the most popular choice, received the next least percentage 
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TABLE 2 


Prewriting Activities Relative to Score 


Prewriting Activities Score 

1 2 3 4 
Topic Choice Outline/Rough Draft 

Neither outline nor rough draft. 5 12 3 3 
Topic #1 Outline only. 1 8 6 1 
N,=329 Rough draft only. 24 152 46 14 
(23.98 % ) Both outline and rough draft. 5 20 22 7 
Neither outline nor rough draft. 3 4 1 0) 
Topic #2 Outline only. 1 10 0 0 
No=170 Rough draft only. 15 78 21 1? 
(12.39% ) Both outline and rough draft. 1 16 4 4 
Neither outline nor rough draft. 8 12 1 0 
Topic #3 Outline only. 7 8 vi 3 
N2=503 Rough draft only. 55 241 66 12 
(36.66 % ) Both outline and rough draft. 7 45 23 8 
Neither outline nor rough draft. 2 a 1 0 
Topic #4 Outline only. 2 8 1 
N4=210 Rough draft only. 31 80 24 4 
(15.31%) Both outline and rough draft. 8 22 15 1 
Neither outline nor rough draft. 2 0 0 
Topic #5 Outline only. 2 3 1 
N-=101 Rough draft only. 19 32 ilal 6 
(7.36 % ) Both outline and rough draft. 0 14 8 2 
Neither outline nor rough draft. 0 il 1 
Topic #6 Outline only. 0 1 2 2 
Ng=59 Rough draft only. 4 24 13 3 
(4.30% ) Both outline and rough draft. il 1 1 
Totals: 203 798 285 86 
N=+13872 14.80% 58.16% 20.77% 6.27% 


(4.6%) of 4-scores. Again, it would appear that frequency of choice of topic inversely 
affected markers’ selection of the best papers. This phenomenon is observed through- 
out the six topics, with topic number six, the least popular topic, receiving, propor- 
tionally, the most 4-scores. The frequency polygon (Figure 1) illustrates this 
phenomenon further. By plotting the percentage of students who chose a particular 
topic against the percentage of 4-scores for each topic it is graphically shown that the 
percentage of 4-scores per topic is inversely proportional to the frequency of topic se- 
lection. Topic number four has not been included in the slope calculations because the 
teacher-marker reaction to papers on this particular topic, as discussed above, ap- 
pears to have adversely influenced the number of 4-scores given, thereby contaminat- 
ing the data for this topic. 

Inspection of the topic statements themselves reveals a number of possible 
inequities among them. For instance, topic number six is the only statement given in 
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TABLE 3 


Topic Relative to Score 


Score 
Category (N=1372) if 2 3 4 x 
oR eee ee ale ee aa ee fe ee ee re i 

Topic #1 35 192 ol 25 5.87 
N=329 (23.98% ) (0.15) 
Topic #2 20 108 26 16 7.04 
N=170 (12.39%) (0.10) 
Topic #3 1 306 97 23 3.56 
N=503 (36.66 % ) (0:35) 
Topic #4 43 117 44 6 8.70 
N=210 (15.31% ) (0.05) 
Topic #5 23 49 20 9 7.15 
N=101 (7.36%) (0.10) 
Topic #6 5 26 | 4 12.78 
N=59 (4.30% ) (0.005) 


x= 45.10 with 15 df significant at p<.001 


two sentences, the second, descriptive sentence offering some elaboration on the first 
declarative sentence. This statement could provide students some additional insight 
into the development of the essay. To a lesser extent, topic number five offers 
additional modification of the basic declarative statement, although the subordinate 
clause is not so much added description as additional declaration. The other topics 
offer only a simple declarative statement in each case. Moreover, topic number four, 
producing the least proportion of 4-scores, is the shortest statement offered of the six, 
appealing perhaps in its brevity and succinctness but, having caught the students’ at- 
tention, offering little additional insight into the direction which the essay might take. 
And, finally, topic number two, the second most productive of 4-scores, suggests, in its 
compound form, an essay development through comparison and parallel structure. 
Hence it may be argued that topics number two, number five, and number six, each 
producing a higher than average percentage of 4-scores, offer an advantage to the stu- 
dents opting in their favor. 


Relationship of Topic to Revision 

Table 4 presents the data obtained relative to Hypothesis 2: The type and amount of 
revision between rough draft and finished composition are not significantly related 
to the topic chosen. Chi-square analysis of the relationship of topic selection relative 
to the frequency of revisions per 1,000 words in each category indicated no significant 
correlation between topic and revisions. Hence the null hypothesis was retained. The 
type and amount of revision was deemed to be unrelated to the topic selected. 


Relationship of Outline and/or Rough Draft to Score 

Table 5 presents the data obtained relevant to Hypothesis 3: Students who employ an 
outline and/or a rough draft do not score significantly higher on their essays than 
those who do not use an outline and/or a rough draft. Over 40% of the students who 
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Figure 1: Topic choice relative to score 
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TABLE 4 


Topic Relative to Revision per 1,000 Words 


Category 1 2 
Mechanical 14.13 16.19 
Lexical 5.47 3.1 
Syntactic 6.45 7.26 
Stylistic 16.09 24.41 
Figurative 10.49 14.53 
Rhetorical 6.32 5.78 


x = 12.29 with 25 df 
nonsignificant at p<.05 


Topic 

3 4 5 
12.9 17.4 15.02 
4.47 6.64 5.7 
8.16 12 9.32 
21.06 25.07 15.28 
14.11 23.00 12.69 
6.73 7.67 2.59 
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used both an outline and a rough draft scored 3 or 4 on their compositions. Con- 
versely, over 80% of those who used neither an outline nor a rough draft scored 1 or 2 
on their compositions. 

Chi-square analysis of the relationship between outline and/or rough draft and 
score indicated a significant correlation (p<.001). Hence, the null hypothesis was re- 
jected, and outlining and rough drafting used together were deemed to correlate posi- 
tively with the score obtained. Further examination of the contribution of cells to the 
overall chi-square revealed that outlining alone does not correlate with the score ob- 
tained, thereby corroborating Emig’s findings. Rough drafting alone was significant at 
the p<.10 level, and, inasmuch as this does not meet the previously set .05 level, Ho 
was not rejected. However, since some relationship between rough drafting and score 
can be shown with 90% confidence, it would appear that more research may be 
warranted before any definite conclusions concerning rough drafting alone should be 
made, and to confirm that Type II error has not been introduced (that is, that a false 
hypothesis has not been retained). 

Some students (4.8% ) wrote neither an outline nor a rough draft and their marks 
reflect this—over 80% of these scored 1 or 2. Some students (5.8% ) wrote an outline 
only and, again, the majority of these students—65% of those using an outline only— 
scored 1 or 2. Where outlining was combined with the writing of a rough draft, a signif- 
icant correlation (p<.001) with the score was observed. Forty-two percent of those 
students who used both prewriting activities in combination scored 3 or 4 on their 
papers (compared to 27% who scored 3 or 4 overall—9.6% who combined both scored 
4, compared to 6.3% overall). 

Outlines varied in form from as few as four points jotted down in phrase—or even 
word (noun)—form to elaborate schemata written out in complete sentences, and 
sometimes involving several sentences to each point. However, where rough or final 
drafts developed considerably further on these schemata, despite their elaboration, 
the schemata were acknowledged as outlines and not as rough drafts. 

Some students used numbers, some dash marks, some indentations, or spaces be- 
tween points. Some students started with one method such as numbering and 
switched to another such as dash marks or spacing part way through. Formal 
outlining, such as that taught in school or displayed in some textbooks, was used in 
approximately 5% of the papers. Any schemata, despite its brevity or elaborateness, 
formal or informal, which demonstrated some thinking on the topic was counted. Of 
course, the amount of thinking could not be assessed, only guessed at. 

One notable outlining method was used by students who began writing a rough 
draft and switched to point-form outlining part way through. These students some- 
times switched back to complete the rough draft and sometimes the outline ended the 
rough draft stage. It would appear that as ideas, triggered in the composing process of 
the rough draft, began coming too rapidly to write down in complete sentences, the 
outline form offered an expedient alternative. Britton’s phrase, “shaping at the point 
of utterance” (Cooper & Odell, 1978; p. 24), appears especially pertinent in describing 
these outlining practices. 

Further evidence of Britton’s observations was noted in outlines which were not 
followed slavishly—or, for that matter, at all—and which varied in length and degree 
of completeness. Where time is a factor, such as on a composition examination, 
outlining often appears to serve the purpose of starting the thinking process on the 
topic, and it is abandoned at some point where ideas flow freely and time is more 
profitably spent writing them down in complete prose form. When ideas begin to “dry 
up,” then reversion to outline form in the middle of the rough draft may also 
restimulate the flow (rather than accommodate too rapid a flow). 
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TABLE 5 


Outline/Rough Draft Relative to Score 


Score 
Category (N= 1372) 1 2 3 4 x 
Neither outline nor rough draft 20 35 7. 4 14.30 (.005) 
Outline only 13 38 20 8 4.20 (.25) 
Rough draft only 148 607 181 51 6.65 (.10) 
Both outline and rough draft 22 118 Hy 23 27.48 (.001) 


x = 52.63 with 9 df 
significant at p<.001 
ee eee eee 


A few students began with outlines which became more and more elaborate as 
they progressed until finally they became a rough draft. Others outlined their essays 
completely, but developed their rough drafts only partially before beginning the final 
draft. Very occasionally, an outline, although on topic, was not followed sequentially 
at all when the rough or final draft was written. This practice suggests, in these few 
instances, a thinking process which is on the topic but which is not using the outline as 
an organizational process for the essay. 

Most students (89.4%) in the sample wrote rough drafts of their essays. 
Seventy-two percent of the students wrote only a rough draft (no outline) prior to 
their final draft. Of these, 76.5°%¢ scored 1 or 2 on their papers (compared to 73% who 
scored 1 or 2 overall—5.2% scored 4 compared to 6.3% overall). As with the use of an 
outline alone, it appears that the writing of a rough draft alone is unrelated to the 
score obtained. An unimpressive chi-square of 6.65 with 3 df (at a level of significance 
of only .10) would suggest as much. However, further research into the relationship 
between score and rough draft may be warranted. 

Rough drafting practices, like outlining practices, varied among papers. Fre- 
quently, students did not finish the rough draft prior to completing their final draft. 
Sometimes the rough draft was developed half or two-thirds of the way to completion 
and left at that point. Most frequently, a final paragraph which did not appear in the 
rough draft was added to the final draft. On a timed (two-hour) composition examina- 
tion, this practice may be explained in part by the pressure to move on quickly to the 
final draft. (One hundred forty-five students, of the 1,372 in the sample, wrote only 
one draft which, for marking purposes, was considered to be the final draft.) However, 
as with outlining, rough drafting appeared to be carried out to a point where writers 
discovered the total content and organization of their essays, and beyond which there 
was no need to continue. At this point, the need to complete the rough drafts was 
obviated, particularly in light of time constraints, and the writers moved on to the 
final draft stage. 

On a very few papers, the rough draft was not adhered to organizationally; that is, 
although the final draft dealt with the same topic generally, the sequence of ideas and, 
to some extent, the treatment were very different from that of the rough draft. As with 
the outline, this anomaly suggests that a very few students used the rough draft stage 
to think about the topic but not to organize or structure it in a form they would 
ultimately use in their final draft. For them, the organizational stage was reserved 
until the final draft was being written. Conversely, a very few papers were virtually 
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nent of the writing process, either had not been learned, or was not evident due to re- 
vision changes being made on the rough draft prior to transcription. The latter, 
however, did not appear to be the case from an inspection of their rough drafts. 


Relationship of Revision to Score 

Table 6 presents the data obtained relevant to Hypothesis 4: The amount of revision 
from rough draft to finished product in each revision category is not significantly re- 
lated to the quality of the finished composition. A histogram (Figure 2) and a fre- 
quency polygon (Figure 3) provide pictorial representations of these data. The graphs 
show that the frequency of stylistic revision per 1,000 words of rough draft is greatest 
for all scores. The frequency of figurative revisions on 2- and 3-score papers ranks sec- 
ond along with the frequency of mechanical revisions on 1- and 4-score papers. 
Syntactic, rhetorical, and lexical revisions are made with decreasing frequency at all 
score levels. 

Chi-square analysis of revision categories relative to score indicated no significant 
correlation between the frequency of revisions per 1,000 words in each category and 
the score obtained. Hence the null hypothesis was retained. 

The amount and type of revision carried out by students while composing on an 
essay examination are unrelated to the quality of the final essay; that is, the revision 
changes made between rough and final drafts on 4-score papers did not differ signifi- 
cantly in number from those on 1-score papers. Despite this, the actual amount and 
type of revising that students undertake are notable. For every 1,000 words of writing, 
approximately 70 changes are made between rough and final drafts. These changes 
may be coded into six quite different categories which are used by students irrespec- 
tive of the quality of the paper produced: mechanical, lexical, syntactic, stylistic, fig- 
urative, and rhetorical. For every 1,000 words of written first or rough draft, students 
make approximately 15 mechanical revisions, 5 lexical revision, 8 or 9 syntactic revi- 
sions, 21 stylistic revisions, 14 figurative revisions, and 7 rhetorical revision. 
Twenty-seven pages of revision examples and detailed discussion, somewhat after the 
fashion of Richards in Practical Criticism (1929), are reproduced in the complete re- 
search report, but are omitted here in the interest of space. 

One-score papers exhibit, to a more noticeable extent than those at other score 
levels, revisions that involve complete changes, or apparent changes, of meaning, that 
is, rewriting of concepts from rough to final draft into longer often more convoluted 
phraseology involving confusing mental leaps, which are usually coded as figurative or 
rhetorical. Also, these students frequently have very little to say on the topic, and 
tend to say it in as many ways as possible, making revisions of repetition (rhetorical 
and stylistic) to fill out their text; they hope, no doubt, to impress by quantity as 
opposed to quality of text. 

Two-score papers exhibited one revision characteristic seldom found in 1-score 
papers, that of a changed or sometimes alternating point of view. Rough drafts written 
in the first person were revised in the final draft to second or third person point of 
view. Those in the third person initially might change to first or second in the final 
draft. Usually this revision was not consistently maintained throughout the essay. 
These revisions, unless contaminated by associated figurative or rhetorical revisions, 
were coded as stylistic. Also evident at this score level was a continuing tendency to 


change meaning, or appear to change meaning, between rough and final drafts, coded 
figurative. 
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TABLE6 
Revision Changes per 1,000 Words Relative to Score 


Score 

Category 1 2 3 4 Total 

Mechanical 15.15 18.04 iy 13.65 58.54 
Lexical 3.26 4.79 5.56 5.08 18.69 
Syntactic 11.65 9.13 6.67 6.6 34.05 
Stylistic 22:19 ZLETO 16.79 22.44 83.81 
Figurative 121 18.4 14.16 11,79 biel 2 
Rhetorical 9.27 5.08 6.79 6.32 27.46 
Total 74.89 Tie2o 61.67 65.88 279.67 


x =5.94 with 15 df 
nonsignificant at p><.05 


Changes per 1000 words 


Mechanical Lexical Syntactic Stylistic Figurative Rhetorical 


Score 1 
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Figure 3: Revision Relative to Score 
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Although there is a similar frequency of revision in each category for each score, 
the nature of these revisions appears to be changing at the 3-score level. Complete 
changes or apparent changes of meaning between rough and final draft, although still 
found, are not quite so prevalent. Better students have more to say on a topic—aver- 
age word counts in the subsample increase with score: 258, 400, 488, and 554 words, re- 
spectively—and, consequently, students seem to grope less for what to say and, 
perhaps, more for how to say it at this level. Fewer 3-score revisions actually weaken 
the final product. And point-of-view shifts occur less often as students at this level 
tend to favor the third person. Associated with these characteristics is an increase in 
the amount of revision to which a single component of text might be subjected. 

Papers at the 4-score level demonstrate further the tendency of abler students to 
be concerned with how to express a concept as opposed to what concept to express. 
Again, better students have more to say, 4-score papers averaging more than twice the 
number of words of 1-score papers, and they demonstrate an increased interest in how 
to say it appropriately. Lexical revision is noticeable in these papers, as is stylistic re- 
vision. Several changes, as opposed to only one or two at lower scores, may occur in a 
single component of text, and these may be more difficult to categorize. The quality of 
revision, consistent with the intellectual abilities of 4-score students, is also notice- 
ably superior to that found in papers of lower scores. It should be recalled that only 86 
papers, 6.27% of the sample, received a 4-score grading in a total of 1372 papers in the 
sample. 

Lexical revision in 4-score papers frequently incorporates other categories of revi- 
sion as well in the same component of text. If the revisions are inextricably bound, 
they are coded as one change, the “highest.” If the revisions are identifiably separate, 
they are coded separately. Very rarely, a revision will not be coded in the “highest” 
category present because of some strong argument otherwise. This decision is usually 
based on what the writer appeared to be attempting to accomplish rather than on 
what was in fact achieved. The sample revisions and discussions in the complete re- 
port (Gee, 1984) exemplify these observations extensively. 


Conclusions 

It would appear that topics cannot be selected, despite the care and rigor exercised in 
their selection, so that they are of equal difficulty or quality. The structuring of the 
topic statements themselves, and the differences in that structuring, appear to influ- 
ence the development of an essay. It also appears that, where several topics are made 
available on an essay examination, the students who opt for the least popular topics 
tend to make the highest grades or scores. Markers may be biased against a topic, get 
bored when marking the same topic over and over again, or more capable students 
may opt for one to two particular topics. Even the order of topics may influence their 
equality. 

Students outline essays and write rough drafts in many ways. A definite correla- 
tion exists between essay quality and the use of an outline and rough draft together. 
Few students outline as formally as is taught in school. Some mix outlining and rough 
drafting. A common practice in both outlining and rough drafting is to discontinue the 
prewriting activity as soon as ideas and structure become obvious and time is better 
spent on the final draft. As well, outline form may be used to cope with ideas which are 
coming too fast in the midst of the development of a rough draft. Conversely, 
outlining, or point form listing, may restimulate the flow of ideas if they cease or slow 
during rough drafting. Finally, a few students appear to think about the essay topic by 
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using an outline or rough draft, but not to structure their final essay based on these 
devices. Essay organization for these students appears to be separate from concep- 
tualization of the topic and reserved for writing of the final draft. 

Although the amount of revision carried out by students on their rough drafts is 
similar despite the quality of the final essay produced (or the ability of the students), 
the quality of revision does appear to vary with student ability. Students undertake a 
surprising amount of revision on essay examinations. Moreover, all students use all six 
types of revision. Weaker students, apparently having much less to say on a chosen 
topic, tend to revise so as to expand their text by repetition. Consequently, they tend 
to produce more rhetorical and stylistic revisions of this nature than do students of 
greater ability. Also, weaker students tend to have more problems with what to say as 
opposed to how to say it, and, as a result, their revisions frequently alter meaning 
completely. These revisions are characterized by confusing structural changes. 

Abler students tend to have more to say about a topic and appear to concern 
themselves more with how to say it. Revisions which result in meaning change are less 
apparent. Concern for the right word becomes noticeable, resulting in an increasing 
number of lexical revisions. More revision of a single component of text is also appar- 
ent at the 3- and 4-score levels. This concern for how a concept might appear in print 
results in many stylistic revisions. However, these stylistic revisions fulfill a different 
purpose than those used by weaker students who revise for mere repetition to add 
quantity of text. 

Some students revise to change the point of view. Concern for this kind of revision 
is not overly apparent in 1-score papers. But at the 2-score level, an increasing aware- 
ness of this stylistic consideration appears to be occurring. Control of person appears 
to offer difficulty for these students. By the 3-score level, students tend to use third 
person at the outset in their rough drafts, resulting in fewer revisions of this nature in 
the final draft. Finally, as would be expected, revisions which tend to weaken rather 
than improve a passage appear to taper off as the quality of papers increases. 

In summary, more questions are generated than are answered in a study such as 
this. Research into the composing process is still in its infancy. However, research into 
the writing product, as well as study of the writing process, can reveal much about the 
process and will do so in a fashion which may be controlled to the extent necessary for 
valid, reliable research. Much remains to be investigated: Should a choice of topics be 
given on composition examinations? Will the same level of syntax among topics result 
in the same level of difficulty? Should one topic only be given, or should a scenario, a 
picture, or music be the stimulus for tests of writing ability? Should the audience be 
identified? The purpose? The form? Can an effective outlining and rough drafting 
process be taught? Can the qualitative differences between revisions of weaker and 
more able students be identified? Can the revision process be taught more effectively 
than it now is? This much is known: 

1. Where a number of topics is offered for examining writing ability, the topic select- 
ed bears a significant relationship to the score obtained. 

2. The type and amount of revision carried out on an essay is not related to the topic 
chosen. 

3. Students who employ an outline and a rough draft together score significantly 
higher on their essays than those who do not. 

4. The amount of revision which occurs between the rough and final draft of an 
essay in an examination setting is not related to the quality of the essay obtained, 
although the quality of the revision may be. 
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The fact that all students do much more revision of their writing in an examination 
setting than was initially suspected is surprising in itself. That they revise for six dis- 
tinct purposes, despite their abilities, is remarkable. The need now is to determine 
how that revision might be directed toward the production of higher quality writing 
from all students. 


Note 
Le Appendices containing a sample booklet, a coding card, and raw data for each score are available on request from the 
author. 
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As Some Children Write 
More Than Others: 
The Impact of Learning to Write 


This study compares the impact of two pervasive instructional approaches on the at- 
titude toward and understanding of writing among grade I students. One approach is 
based on an introduction to the skills of writing and the other sets out to treat the stu- 
dents as writers by having them write. The sample consisted of six classes (N= 109) 
split between the two approaches. The students were interviewed on four aspects of 
writing—function, attitude, self-concept, and vocabulary—initially in October and 
then again in May of the same school year. An analysis of the change over the course of 
the year revealed a significant gain in the technical vocabulary of literacy for the stu- 
dents who learned by writing, but not in their attitude or in their self-concept as 
writers. 


This has proved a promising decade for writing instruction. From the work of Graves 
(1983; Hansen & Graves, 1986) on the east coast to the Bay Area Writing Project 
(Myers & Gray, 1983) on the west, strong programs have emerged backed by 
outspoken advocates. Simply put, these programs would have students write from the 
very beginning as writers. Recent research into the process and acquisition of writing 
suggests that little else may be needed (Harste, Woodward, & Burke, 1984; Smith, 
1982). Rather than waiting until the skills to write are in place, students in the first 
grade are encouraged to find their way into print and publishing without going 
through the workbook approach to prove and practice their readiness. Thus far the re- 
ports on this new method have been principally testimonial and ethnographic, with 
enthusiastic results in both cases. What has been repeatedly found is that the stu- 
dents are writing marvelous things and that they are proud of what they write (e.g., 
Silvers, 1986; Willinsky, 1984). The claims are supported by impressive instances of 
the students’ work. The celebration of this work is underwritten by a sense both of the 
improved performance of the children and their improved attitude toward writing. 
The question which this study takes up on a very modest scale is whether we can as- 
certain more precisely the differences this writerly introduction into literacy in the 
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first grade might make when compared with the experience students receive in what 
might be contrasted as a skill-sequenced approach. 

A review of the strong but often implicit claims made by those who advocated 
learning by writing (the expressive-writing approach) suggested that there should be 
a measurable improvement in three areas: (a) the students’ understanding of the 
function of writing, (b) their attitude toward writing, and (c) their regard for them- 
selves as writers. On the other hand, the literature also suggested that the traditional 
or a skill-sequenced approach to writing through preparatory steps would produce a 
stronger sense of what a word or a sentence are. 

In order to appreciate the study which follows, it may be helpful to review the 
basis of those claims. The expressive-writing approach, for instance, is designed to ad- 
dress the limited range of functions writing serves in the schools (Rosen & Rosen, 
1973). Britton’s scheme of expressive, transactional, and poetic modes has been used 
to map out the limited nature of students’ written work in school; Halliday’s taxon- 
omy of seven language functions is often cited to suggest the missing elements. 
Holdaway (1979), for example, cites Halliday in declaring that we should be exploiting 
what he terms writing’s “functional richness” rather than maintaining in the class- 
room the more common “narrow range of purposes” (p. 14). As the call in this ap- 
proach is for students to write as writers, these student-writers are to grasp the real 
world of writing and the full range of its usefulness (Baker, 1984). Thus we might ex- 
pect that young students will learn more through this experience of (a) what a writer 
does, (b) who writes, and (c) what writing is good for. Accordingly, it was decided to 
ask the students of the grade 1 class about these three points in order to ascertain 
their sense of writing function. 

On the second item of inquiry—importance of the students’ attitude toward liter- 
acy—Johns (1974), for one, found that a key difference between good and poor read- 
ers was that good readers saw reading as a meaningful experience with a significance 
outside of the classroom. It seemed appropriate to borrow from Johns’ work on read- 
ing attitudes to explore a comparable aspect in the student’s appreciation of writing’s 
purpose. A particularly crucial question in the attack on the skill-sequenced program 
led by such figures as Graves and Holdaway has been on the very shortage of meaning 
which occurs in its emphasis on the skills of letter identification and decoding. We 
might expect that, spared this process in favor of writing out their own experiences, of 
acquiring ownership over their own work, the expressive-writing students’ attitude 
toward writing should appreciably improve as should their sense of its role or function 
outside of the school. 

Self-concept was the third area of inquiry and an old strand in progressive pro- 
grams. As the students realize a new written voice, they should discover a new sense of 
themselves as writers. Since the turn of the century, the progressive movement has 
advocated the psychological benefits of creative writing (Brand, 1980). Over this time 
writing has been viewed as a means of unfolding the child’s personality, providing 
self-confirmation and a sense of dignity; as the child wrote in this fashion, progres- 
sives claimed, “the more a person he became” (Suffinsky, 1933). Writing was a matter 
of “the whole problem of individual identity and how it develops” (Holbrook, 1968, 
p. 1). We were “writing to create ourselves,” as one recent writing text claims on behalf 
of students (Allen, 1982) or, to return to Graves, “the child’s marks say ‘I am” (1983, 
p. 3). Finally, on a less existential note and yet with more hard research to support it, a 
positive sense of self has proven to be related to a success in literacy which extends to 
academic performance in general (Brookover, Thomas, & Patterson, 1964; Campbell, 
1965). 
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On the skill-sequenced side, a sensitivity to the units of written language in devel- 
oping literacy has long been an important part of the study of reading readiness 
(Evancheko, Ollila, Downing, & Braun, 1973). A writing vocabulary, from having a 
sense of sound-letter correspondence to acquiring the elements of a technical 
vocabulary of literacy, would seem to be crucial in the decoding process as it concen- 
trates on taking meaning from the nature of individual elements and not from 
contexts; this isolation of elements receives a corresponding emphasis in the students’ 
work in the skill-sequenced classrooms through worksheets: “copy these words over”; 
“what letter does this word start with”; “complete this sentence.” With this focus, 
these students were expected to achieve more in the skills of word and sentence iden- 
tification. In the expressive-writing classroom, this technical vocabulary of literacy 
arises, if at all, in an incidental manner, through the actual use of such elements as 
words and sentences (Graves, 1983). 


Method 

This study used a pretest posttest design to ascertain changes in the student’s regard 
for writing in a manner which could be measured and compared under two different 
programs of instruction. The test instrument consisted of 22 questions which were in- 
dividually administered to the grade 1 students in a 10-minute interview which was 
conducted in October and then again in May of the same school year (Table 3). The 
questions covered the four areas described above: writing function, writing attitude, 
self-concept, and technical vocabulary. The test instrument used with the students 
contains a number of items borrowed and new. In the areas of vocabulary and writing 
function, the items came from the methods of Downing and Oliver (1973-1974) for 
testing the identification of “words”; Calkins (1980) for punctuation; and Mayfield 
(1983) for writing function. The groundwork in assessing writing attitudes and self- 
concept as a writer had not been so thoroughly laid. To measure the changes in the 
students’ regard for themselves over the course of grade 1 both direct questions—“Are 
you a writer?”—and more subtle means were employed. After examining the meager 
resources for assessing self-concept in young schoolchildren, it was decided to use a 
variation on the picture self-identification technique. Seven cartoons were developed 
in male and female versions which showed a fairly uniform set of characters in a num- 
ber of activities and situations from the classroom to the doctor’s office. The students 
were asked to select one figure in each picture who they would be “if they had to be one 
of the people in the picture.” The characters had been designed to represent either an 
active or passive pose, with the student’s selection addressing the question of whether 
an active, developed sense of self was encouraged by fostering the student’s personal 
voice on the page. 

The pictures were initially tested with a class of students and the reasons for their 
choices were used to modify the cartoons, removing what appeared to be extraneous 
and distracting elements. With the pictures adjusted, the entire instrument was 
piloted with a class of 20 students after which further refinements were made to the 
ordering and wording of the questions. The interviews with the students were held in 
the school library and began with the cartoons as these were found in the pilot to put 
the children at their ease. 

In terms of scoring the results of the individual interviews, the questions were de- 
signed to produce four simple measures: for Writing Function, the students were 
credited with the number of instances given to such inquiries as “what can be done 
with writing?” Writing Attitude was scored by the number of items given to such 
questions as “what are all the good/bad things about writing?” arriving at a positive/ 
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negative score for the student’s attitude; Self-Concept was also a measure of positive 
responses to the inquiries or “active” choices with the pictures; and Technical 
Vocabulary was a matter of adding up the correct responses. To analyze the impact of 
the two programs, t-tests were run on the differences in raw scores between the 
October and May interviews to establish the significance of the change over the year 
in their understanding and regard for these different aspects of literacy. 

The sample for this study consisted of six grade 1 classrooms (N= 109). The class- 
rooms were picked on the advice of the language arts coordinator for the district who 
was asked to recommend three teachers clearly using an expressive-writing approach 
and three clearly using a skill-sequence approach; this selection process was further 
refined by ensuring that the teachers were in schools which were roughly equivalent in 
the socioeconomic level of the neighborhoods. The actual writing programs used in 
the classrooms were monitored by collecting daily samples of the work of six randomly 
chosen students from each of the six classes over the course of eight two-week periods 
during the same year. These writing samples demonstrated that though there was a 
clear distinction in writing instruction between the two programs, the classrooms 
proved to be other than either ideal or pure instances of either approach. 

As can be noted in Table 1, a few workbook pages did turn up in the samples col- 
lected from the expressive-writing classrooms and the odd narrative was written in 
the skill-sequenced classroom, which is to say that the study was conducted in typical 
classrooms in the hands of typical teachers. The degree of implementation is repre- 
sentative of what we might expect in the filtration and adaptation of ideas throughout 
the school system. The actual measure of the differences in the nature of the writing 
of the students in two programs suggests that in a typical week in the skill-sequence 
classrooms the students did about three skill sheets, compared to less than one on 
average for the expressive-writing classrooms; in the amount of sustained writing, the 
expressive-writing students wrote two and one-half pages compared to little over half 
a page for the skill-sequenced students. Judging by what the students actually pro- 
duced, it seems clear that the language arts consultant provided a relatively accurate 
reading of the differences in program. 


Results 

Of the four composite measures in which we might have expected a sizable improve- 
ment of one program over another, only in the technical vocabulary of literacy did the 
expressive-writing classes make significantly greater gains than the skill-sequenced 
classrooms (Table 2). To check the relationship among the four composite measures a 
Pearson product moment correlation coefficients matrix between all of the pairs was 
calculated. The highest correlation was .26 between writing function and self-concept. 
Because gender has proven to be an important influence in children’s response to 
schooling, the results were also run for significant differences in the gains made by 
gender and none found were in the four composites. Finally, an analysis of variance 
was used to determine the interaction effect between program and gender; there was 
no significant interaction on the composite measures. 

The students’ grasp of the function of writing (of who writes, what an author does, 
what can be done with writing, and what do you make stories about) showed a gain in 
both programs which did not differ significantly in the composite scores. But on one of 
the individual items—“what can be done with writing?”—the expressive-writing class 
was able to name a significantly larger number of functions for writing over the course 
of the year (p< .05), with the skill-sequenced class coming up with fewer functions in 
the final interview than they had offered eight months earlier. 
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TABLE 1 
Average Number of Pages per Student in a Week by Type of Writing 


a eB eee ee 
Type of Writing (in pages) 


Group N Skill? Expressive? 
Skill-Sequenced Students 57 3.18 59 
Expressive-Writing Students 52 ats) 2.69 


Se ee ee ee er ee a 
4Skill-based writing included work on phonetics, letter formation, and sentence completion. 


bExpressive writing demonstrated student control of units of at least a sentence in length. 


TABLE 2 


t-Tests of Pretest to Posttest Gains on Four Composite Measures 


Expressive 
(N=109) Skill-Sequence Writing (df=107) 
Item Means SD Means SD t p 
Writing Function .1105 eth Uk 1.2885 5.374 vot lw KE) BA 
Writing Attitude .1075 2.232 = D902 2.107 1.47 .144 
Self-Concept 10.5789 3.775 9.8269 2.988 1.15 .204 
Writing Vocabulary 2.3508 4,042 5.0557 3.528 ea 001 


In the attitude measures, the expressive-writing class’s positive responses to the 
activity of writing declined over the course of the year while the skill-sequenced class 
did not grow appreciably more positive toward writing. This decline in the regard for 
writing among the students working in the expressive-writing program raises the most 
interesting challenge. The measure is admittedly crude; attitude measures are always 
a source of caution and qualification. But in examining the different aspects on which 
the students were queried, it does seem that a teacher who does more personal and ex- 
tended writing with the students will not ensure a growth of a higher regard for the 
virtues of writing. Students working with skill sheets were no less positive about writ- 
ing. It would seem that as the students in the expressive-writing classes did more writ- 
ing and had more demanded of them in writing, they knew enough of its intricacies to 
name what was termed in the interview question “the bad things about writing.” The 
bad things ranged, in the students’ words, from the problem of “messy writing” and 
“scribbling” to “if you don’t know what you are supposed to do.” The good things 
about writing, on the other hand, included the fact that you “make pictures and write 
stories,” “learn about stuff,” and “exercise arms.” That these writers who wrote four 
times as much as the other students could also suffer a decline in their opinion about 
whether writing is a good way to say something would also seem to indicate a certain 
level of frustration with the act. Perhaps some of this frustration is avoided in the 
lockstep, programmed skills approach to writing readiness. 

In the assessment of self-concept, both groups made similar gains in their 
declarations of themselves as writers and as readers. This was equally so with the set 
of cartoons for self-identification, as both groups made a similar move to more active 
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figures. However, the clearest gains in one program over another was in the technical 
vocabulary of writing rather than in the students’ expression of their affective or func- 
tional regard for writing. With the elements of a writing vocabulary measured, from 
the identification of words and sentences to naming the components of a story, the 
expressive-writing class achieved a significantly better gain. Still, this was somewhat 
other than what might have been expected. In the skill-sequenced classes, the stu- 
dents’ work showed a focus on the word, for instance, as they copied the words from 
the board or the top of their sheet, as well as on the word’s parts as they identified the 
initial sounds through picture-letter matchups. Yet the expressive-writing students 
achieved a greater gain in their oral identification of a word as a unit, with only a 
slightly weaker show in identifying words in the written format. Understandably, with 
the opportunity to write narratives, this group also showed a significantly greater gain 
in their ability to identify the major punctuation marks. 


Discussion 

This initial study suggests that having students do more extensive and “meaningful” 
writing will not ensure a notably greater expression of a confidence in themselves as 
writers nor a greater enthusiasm about the possibilities of writing, though they may 
show signs of a greater sense of its usefulness in the world at large. The students in 
both classes were not in a position to evaluate their instruction in comparison to how 
adults write, as we would, especially in relation to how the professional author writes. 
If the skill-sequenced students did not find their introduction to writing as stifling as 
Smith (1982) predicts, it may be because they have no romantic notions of writers 
working away in New York studio-lofts or Rocky Mountain cabins with which to con- 
trast to their own sorry assignments. The skill sheet provides an immediate sense both 
of writing and of a clear-cut competence marked with a sticker, however distant it is 
from having a story accepted by The New Yorker. 

On the other hand, the students who had the opportunity and responsibility to 
sustain their work for a page or more could be expected to know in more detail the dif- 
ficulties or “bad things about writing.” Professional writers, after all, are quick enough 
to tell you about the terror of the blank page which they face time and again: Graves 
(1984), for example, confesses that “all of the writing in these years (1978-1982) was 
very painful to me” (p. 5). Yet he and other expressive-writing advocates are the ones 
who wax enthusiastic about the creative pleasures of these child-writers, and though 
they might expect the children to catch a good part of their enthusiasm for writerly 
writing, the students seem to catch no more of it than is provided in classes where they 
are child-students carefully mastering and climbing the steps to literacy. 

These findings also suggest, again in a qualified manner, that students who write 
more independently at least seem to learn more about the units of writing—the word 
and sentence—which they have worked on so diligently; they seem to know more, too, 
about the function of writing, as they have done more with it. In comparison with such 
gains, the larger questions which expressive writing begs—are the students changed 
by writing? are they creating or finding themselves on the page?—must continue to be 
guided by acts of faith. The exploratory effort to tap changes in self-concept through 
the use of the cartoons suggested that both grade 1 groups had made a slight shift in 
their identifications to more assertive characters, as if getting out into the public 
forum of the school full time might in itself be an uplifiting experience. Yet whether 


writing out of oneself is a means of creating a self-concept of greater agency remains to 
be established. 
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It will take a number of different designs and repeated measures to ascertain with 
any degree of certainty the impact on students of these different introductions into 
literacy. The question is not simply academic. Teachers are currently confronted by 
advocates of the two contesting paradigms for introducing children into literacy, and 
they deserve to see the respective claims of the two approaches tested and compared. 
Research must continue to be done on the role of self-expression in the development 
of the child, as well as into the question of whether broadening the function of writing 
can have lasting effects on participation in the literacy of the community. Finally, 
there remains the extent to which a skills approach can still lay a foundation of self- 
confidence and competence in literacy, which is to say that both approaches, the skill- 
sequenced and the expressive-writing, continue to deserve careful scrutiny and more 
sophisticated techniques in assessing what students are learning about writing and 
about themselves as they are taken down these different roads to literacy. 


TABLE3 


Interview Schedule 


Writing Function 
1. What are all the things you like to make stories about? 
2. Whatcan be done with writing? 
3. Whodo you know who writes? 
4. Whodo you write for (make stories for)? 
5. What does an author or writer do? 


Writing Attitude 
6. What are all the good things about writing? 
7. What are all the bad things about writing? 
8.  Iswriting fun? 
9. Doyoumake something with writing? 
10. Iswriting things down hard work? 
11. Iswriting a good way to say something? 


Self-Concept 
12. Whowould you like to be in this picture? [seven pictures] 
13. Are youawriter ora printer? 
14. Are youareader? 
15. Whois interested in what you write (in your stories)? 
16. Would you like to make books for others to read? 


Technical Vocabulary 
17. Isthis a word? [say bicycle, er, friend, b] 
18. Howmany words is this , one or two? 


[yellow; kangaroo; my friend; went swimming] 
19. Does this card have one word on it, or more than one? 
20. Can you find three sentences in this story? 
21. Can youshow mea period, a question mark, an exclamation mark, and quotation marks? 
22. What things make upa story? 
ieee re eee 
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Note 


ile The author wishes to acknowledge with gratitude the support of the Social Sciences and Humanities Research Council of 
Canada (Grant No. 410-84-0046) in funding this research and to thank Carl Braun and two anonymous reviewers for their 
helpful comments. 
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A Comparison of the Reading 
Processes of Adult Illiterates 
and Four Groups of School Age Readers 


The purpose of this study was to describe and compare the reading processing behav- 
ior of adult illiterates and low and regular readers in grades 2 and 4 when reading nar- 
rative and expository text at their instructional level. The observed data were 
categorized according to different cognitive processes and grouped into three clusters: 
code dominant, meaning dominant, and control dominant. Indicators of reading ef- 
fectiveness were also determined and were designated as Cluster 4. 

The results of MANOVAs indicated that while adult illiterates were most like the 
grade 2 readers in terms of their processing behavior, they were more effective than 
these readers in their reading, which may be due to the adults being able to use more 
elaborate prior knowledge schemas in relation to their processing. The results also 
point to the complexity of the cognitive aspects of reading and the inappropriateness 
of focusing on single processing variables. 


Illiteracy is a serious problem among adults in the United States and elsewhere 
(Nickerson, 1985). Efforts to ameliorate this problem are not always successful, par- 
ticularly in light of a drop-out rate of more than 50% and even higher for low achiev- 
ing adult illiterates. One reason for this high rate of attrition is that literacy programs 
are not always meaningful to adult illiterates (Harman, 1984). 

While programs for adult illiterates are frequently based on programs for young 
children, models for describing literacy for children may not be easily transferred to 
adults (Lytle, Marmor, & Penner, 1986). While some aspects of a model describing 
adults and children are likely to be different (especially the uses and effects of liter- 
acy) it is possible that cognitive processing demands may be similar. While much is 
known about the cognitive processing of children, little is known about how adult 
illiterates process print (Arnove & Arboleda, 1977; Kavale & Lindsey, 1977; Lindsey & 
Jarman, 1984). It was the purpose of this study to describe and compare the cognitive 
processing of a group of adult illiterates and four groups of school age children when 
reading narrative and expository text at their instructional level. 


Dr. Fagan currently holds a McCalla Research Professorship Award in the Department of Elementary Education where he is 
pursuing further research related to adult illiteracy. 
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Reading is commonly described as an interactive or transactive processing activ- 
ity in which readers interrelate information from the text (visual information) and in- 
formation from past knowledge (nonvisual information) in order to construct 
meaning for the text (Goodman, 1970; Rosenblatt, 1978; Rumelhart, 1977; Smith, 
1982; Stanovich, 1980). The nature of this interaction is rather complex (Rigg & 
Kazemek, 1983) since cognitive processes do not operate independently (Cronin, 
1982), but occur “sometimes in parallel, sometimes sequentially” (Kintsch & van Dijk, 
1978, p. 363). Stanovich (1980) maintains that readers process texts not only in an in- 
teractive manner but also in a compensatory manner, that is, the reader operates 
through a series of checks and balances in allocating cognitive capacity according to 
the various demands in making sense out of text. 

The nature and degree of a reader’s cognitive processing may be influenced by a 
number of factors such as the reader’s affective state, the amount and nature of prior 
knowledge and language competence, and the difficulty and nature or structure of the 
text. The two latter factors were investigated in relation to the cognitive processing of 
the subjects, while an attempt was made to control the others through the selection of 
subjects and materials. 

Research by Reiser and Black (1982) showed that the nature of the text structure 
influenced the manner in which it was processed for meaning. They stated that where- 
as in reading expository text a reader is concerned mainly with building or expanding 
knowledge structures in narrative text the reader’s main task is to access an appropri- 
ate schema within which various knowledge statements may be incorporated and tied 
together in a causal manner. Although researchers such as Mandler and Johnson 
(1977), Meyer (1981), and Rumelhart (1977) have delineated the nature of various 
text structures and have studied the implications of such structures for reading com- 
prehension, they have not compared readers across narrative and expository text. 


Method 
The means employed to achieve the purpose of the study are described below. 
Sample 
A sample of 20 adults was obtained through the assistance of the coordinator of a liter- 
acy project using volunteer tutors for adults whose reading levels ranged between 0 
and grade 9. Adults whose reading levels were about grade 4 level or below were re- 
quested, since it is this group which is largely ignored (Lytle, Marmor, & Penner, 
1986). All adults had been interviewed by the literacy project personnel and assessed 
by the Standard Reading Inventory (McCracken, 1966) with respect to reading 
achievement level. The reading achievement levels of the 20 adults chosen for the 
study ranged from primer to grade 3.5, with a mean of 2.4. There were no IQ scores 
available for the adults, but none were considered mentally handicapped according to 
project personnel. All except three of the adults were being tutored through the pro- 
ject. These three, although interested in receiving help, were not able to do so at that 
time because of personal reasons. The adults represented both lower and middle 
socioeconomic levels and ranged in ages from 21 to 60 years. Approximately one-half 
of the group were unemployed at the time. Two were homemakers and one was a 
retiree. While some saw literacy as a step toward employment, others saw it as a 
means of personal development. 

Twenty low and 20 regular achieving grade 2 students were drawn randomly from 
populations of 32 and 43, respectively, in schools assigned for the study by a large 
urban school system. All 21 low and 22 regular achieving grade 4 students available 
were included in the sample. Low and regular achieving readers were so designated by 
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the school principals and teachers and reading achievement scores of the Gates- 
MacGinitie Reading Test (1980) were then obtained from the school records. The 
mean reading achievement scores of the low and regular achieving grade 2 readers 
were 1.5 and 2.7 respectively, which were significantly different, T(38) = 12.84, 
p<.001. The mean reading scores for the low and regular grade 4 students were 2.8 
and 5.7, also significantly different, T(41) =12.01, p<.001. All readers were within the 
average IQ range as assessed by the Primary Mental Abilities Test (1965) for the 
grade 2 readers and the Canadian Cognitive Abilities Test (1980) for the grade 4 stu- 
dents. 

The four schools from which the sample was drawn were located in middle and 
lower socioeconomic areas of the city. Like the adult sample, all were native speakers 
of English. More specific information on the sample is given in Table 1. 


Narrative and Expository Texts 

The narrative passages were written by Cronin (1982) using criteria developed from 
the literature on story grammar, lexical difficulty, and sentence complexity. The 
specific criteria ensured that the passages were similar in nature to the passages of 
reading instructional materials and that there was a progression of difficulty over 
grade levels. The passages may be considered to possess ecological and construct 
validity. Passages were written at six levels (grades 1 to 6) and each passage involved a 
story dealing with a particular animal. In a pilot study with five adults (not part of the 
sample) passage | was found to be too difficult for some and the experimenter wrote 
two simpler passages at a primer and preprimer level based on the criteria outlined by 
Cronin. 

The expository passages were taken from the science section of Form B of the 
Individual Evaluation Procedures for Reading (Rakes, Choate, & Waller, 1983). Both 
Forms A and B consist of three sections or sets of passages (literature, science, and 
history) and within each set there are 11 passages ranging from primer level to 
grade 10. Vocabulary and sentences in the passages were patterned on high frequency 
counts from commonly used basal reader series. Field tryouts of all passages were con- 
ducted on 1,171 children. The science passages in Form B were chosen since the topic 
was animals, thus corresponding to the topic of the narrative passages. In order to par- 
allel the narrative passages at the low levels, a passage at the preprimer level was writ- 
ten by the experimenter. 


Instructional Level 

Passages were designated as instructional level if the reader made 10% or fewer 
uncorrected miscues and of these miscues at least 75% resulted in no or minor mean- 
ing change (see Table 10 for definition). Passages that were more difficult according 
to these criteria were designated frustration level, which Silvaroli (1982) describes as 
the level where comprehension is poor and where there is tension and slow, halting 
reading. Since regular achieving readers in grades 2 and 4 would not likely engage in 
reading materials at frustration level, materials at this level were not analyzed. 


Procedure 

The researcher and two trained assistants administered the passages which the 
subjects read orally in a one-to-one situation and immediately afterward gave a free 
recall of what they had read. The subjects were told that the passages would become 
increasingly difficult but that it would be appreciated if they would try as much as 
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TABLE 1 


Information on the Sample 


Low 2 Reg. 2 Low 4 Reg. 4 Adult 


Reading achievement range 13. Lat 2.3-3.9 1.9-3.5 4.2-7.2 P-3.5 
Reading achievement mean 1.5 peg 2.8 5.7 2.4 
IQ range 96-112 99-118 98-110 95-115 NA 
IQ mean 109 112 106 110 NA 
C.A. range 7.2-8.1 CO= 701 eros 9.3-10.0 21-60 
C.A. mean 7.3 7.4 9.5 9.3 NA 
Female 6 ll 4 i 9 
Male 14 9 17 11 14. 


they could. All subjects were most cooperative. The order of administering the narra- 
tive and expository passages was varied so that if the first subject received a narrative 
set first, then the second subject received the expository set first. Each subject re- 
ceived asample passage from each set. 


Data Analysis 

On the basis of the literature reviewed, a number of cognitive processes were identi- 
fied. Because the intent of the study was to describe the processes of adult illiterates 
and compare them to the processes of children while reading, and because cognitive 
processes are not directly observable but must be inferred from ongoing behavior, 
each process was operationally defined in terms of data obtained while the adults read 
orally. Since some processes could not be operationalized on the basis of miscue data 
(for example, inferring), these were excluded. 

The processes chosen for description were: attending, analyzing, symbol-sound 
associating, meaning associating, predicting, synthesizing, and monitoring. Processes 
are not independent of the text data being processed, nor are they independent of 
each other. In order to reflect those interdependencies, the processes were grouped 
into three clusters. Cluster 1, a code dominant cluster, included the processes of at- 
tending, analyzing, and symbol-sound associating, and focused on within-word struc- 
ture. Cluster 2 was termed a meaning dominant cluster and comprised the processes 
of meaning associating, predicting, and synthesizing, and focused on words, phrases, 
and sentences. The third cluster included monitoring variables and was designated 
the control cluster. A fourth cluster was identified which included effective reading 
behaviors: ratio of miscues to words read, nature of meaning change, pronunciation of 
key words (concepts), and recall of key units. 

All data were transcribed and assigned to a particular process or behavior within 
each of the four categories. All tabulations were summed across passages at the in- 
structional level and converted to ratios (see Table 10 for definitions and scoring pro- 
cedures). Interrater reliability by two independent raters for assigning data to 
categories was 93.2% agreement. 

Differences across readers and text type by cluster were tested by a multivariate 
analysis of variance. Where differences occurred, each subvariable within the cluster 
was subjected to a univariate MANOVA. 
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TABLE 2 


MANOVA Pairwise Contrasts by Group on 
Cluster 1 and Subvariables 


Cluster 1 df1 df2 F p< 
Adult - Low 2 10 187 4.60 O01 

- Reg. 2 10 187 2.77 01 

- Low 2 10 187 2.71 O1 
- Reg. 4 10 187 4.73 .OO1 


Var. 2 (Mod attend) 


Adult - Low 2 1 196 15.01 001 
- Reg. 2 1 196 8.31 “ey 
- Low 4 1 196 6.49 On 
- Reg. 4 1 196 10.49 001 
Var. 3 (Low attend) 
Adult - Low 2 1 196 2:01 NS 
- Reg. 2 1 196 oo NS 
- Low4 ih 196 8.21 OL 
- Reg. 4 1 196 18.15 .001 
Var. 8 (Final letter cues) 
Adult - Low 2 1 196 lows O01 
- Reg. 2 1 196 69 NS 
- Low 4 1 196 2.92 NS 
- Reg. 4 ll 196 3.69 05 
TABLE3 


Means and Variances by Groups on Cluster 1 Variables 
for Which There Were Significant Differences 


Adult Low 2 Reg. 2 Low 4 Reg. 4 

Var.2 (Mod. attend) 
X 15 04 .O7 08 06 
Var. 02 .00 01 O1 02 

Var.3 (Lowattend) 
xX 18 .20 .20 32 38 
Var. .02 .O7 .06 04 04 

Var.8 (Final letter) 
xX 58 .79 63 67 68 
Var. 04 .04 10 .03 04 


a ea ana annem 
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Results 
The analysis yielded the following results. 
Processing Clusters 
The results for Cluster 1 indicate that there were significant main effects for reader 
groups F'(40, 710)=1.95, p<.001 and text type, F'(10, 187) =2.07, p<.05. There were 
no significant interaction effects, F(40, 710)=.91. Only at the cluster level was the na- 
ture of the text significant in terms of the processes used. Similar findings for text 
occurred for Clusters 2 and 4. 

In order to determine which reader group contrasts were significant, Cluster 1 
and each of the subvariables for which there were significant differences across groups 
were subjected to post hoc MANOVAs. The results are shown in Table 2. Adult 
illiterates differed significantly from each of the other groups on Cluster 1 (Code 
Dominant Processing). The three subvariables within this cluster for which there 
were significant differences among groups were moderate and low attending, and an- 
alysis of words by final position. The means are given in Table 3. Adult illiterates en- 
gaged in more moderate attending than all other groups. They were also less likely to 
select final letter cues than were low grade 2 and regular grade 4 readers. 

There were also significant main effects for group F'(24, 667) =3.49, p<.001, and 
text type F(6, 191)=5.19, p<.001,- with no significant interaction effects 
F(24,667) =1.48, for Cluster 2. Table 4 reports the post hoc MANOVA contrasts by 
reader groups on Cluster 2 and the four subvariables for which significant differences 
were indicated. Means are given in Table 5. 

The adult illiterates differed from all other groups on the nature of their Meaning 
Dominant processing. Within this cluster, significant differences occurred for the 
subvariables, synthesizing (sentence and part sentence units) and predicting (from 
preceding phrase). Generally, the adult illiterates engaged less in synthesizing at the 
sentence and part sentence levels when utilizing appropriate semantic and syntactic 
cues; they demonstrated more instances of synthesizing when semantic though not 
syntactic cues were used appropriately. They also engaged in less predicting based on 
the meaning of the phrase prior to the miscue. 

The results for Cluster 3 indicated significant main effects for reader group only, 
F(12, 513) =2.22, p<.01. The data in Table 6 show that the adult illiterates differed 
significantly from all groups except the low grade 2 readers on the Control Processing 
Cluster. 

The one subvariable for which differences across groups were significant was hesi- 
tation. As in the case of the findings for the total Cluster, the adult illiterates and low 
grade 2 readers did not differ on this variable. From the data in Table 7, it can be seen 
that hesitations were quite characteristic of the reading behavior of these two groups. 
A hesitation occurred for almost every miscue compared with the regular grade 4 
group who hesitated only once for every 10 miscues. Thus it seems that the two groups 
of readers with lowest achievement scores were more unsure of themselves as they 
read and tended to engage in considerable overt monitoring when asked to read orally. 


Reading Effectiveness Cluster 
There were significant differences across reader groups, F'(24, 667) =4.34, p<.001 and 
text type, F(6, 191)=9.81, p<.001 on the overall measure of reading effectiveness. 
There were no significant interaction effects, F(24, 667) =1.14. 
The data in Table 8 indicate that the adult illiterates differed significantly from 
all other groups on the Reading Effectiveness Cluster. The means are given in Table 9. 
Significant differences occurred between adult illiterates and all other groups on 
the miscue ratio. However, unlike the low grade 2 readers, only very few of these 
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TABLE 4 


MANOVA Pairwise Contrasts by Group 
on Cluster 2 and Subvariables 


Cluster 2 df1 df2 F p= 
Adult - Low 2 6 191 3.28 OL 

-Reg. 2 6 191 6.68 001 

- Low 4 6 191 AZ .OO1 

-Reg. 4 6 191 4.65 001 


Var. 2 (Synthesizing—inappropriate syntactic cues) 


Adult - Low 2 1 196 1.48 NS 
- Reg. 2 1 196 5.05 05 
- Low 4 1 196 5.30 .05 
-Reg. 4 1 196 6.29 01 


Var. 3 (Synthesizing—sentence) 


Adult - Low 2 1 196 4,25 105 
- Reg. 2 1 196 1.67 NS 
- Low 4 1 196 .98 NS 
- Reg. 4 1 196 11.00 .0O1 
Var. 4 (Synthesizing—part sentence) 
Adult - Low 2 1 196 2.19 NS 
- Reg. 2 il 196 .68 NS 
- Low 4 1 196 7.38 .O1 
- Reg. 4 i 196 4.87 05 
Var. 6 (Predicting—preceding phrase) 
Adult - Low 2 1 196 7.61 OL 
- Reg. 2 ih 196 1.51 NS 
- Low 4 il 196 4.33 05 
- Reg. 4 1 196 1.35 NS 
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TABLE5 


Means and Variances by Groups on Cluster 2 Variables 


for Which There Were Significant Differences 


Cluster 2 Adult 


Var.2 (Synthesizing—inappropriate syntax) 


a4 .20 
Var. .06 


Var.3 (Synthesizing—sentence) 


X 44 
Var. .06 
Var.4 (Synthesizing—part sentence) 
xX Pu id 
Var. .04 


Var.6 (Predicting—preceding phrase) 
X 07 
Var. .0O 


MANOVA Pairwise Contrasts by Group 


Low 2 


15 
04 


TABLE6 


Reg. 2 


ss 
02 


36 
05 


.80 
03 


.16 
02 


on Cluster 3 and Subvariables 


Cluster 3 dfl 
Adult - Low 2 3 
- Reg. 2 3 
- Low 4 a 
- Reg. 4 3 


Var. 2 (Hesitations) 
Adult - Low 2 

- Reg. 2 

- Low 4 

- Reg. 4 
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df2 


194 
194 
194 
194 


196 
196 
196 
196 


Low 4 


py 
O1 


48 
.03 


87 
O01 


14 
O01 


Reg. 4 


.10 
O1 


.60 
04 


87 
02 


.10 
02 


p< 


NS 
O01 
.OO1 
O01 


NS 
.O1 
001 
O01 
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TABLE 7 
Means and Variances by Groups on Cluster 3 Variables 


for Which There Were Significant Differences 


Cluster 3 Adult Low 2 Reg. 2 Low 4 Reg. 4 


Var. 2 (Hesitations) 
xX 93 92 44 18 10 
Var. eet 1.58 A .06 JU? 


TABLE 8 


MANOVA Pairwise Contrasts by Group on 
Cluster 4 and Subvariables 


Cluster 4 dfl df2 F p< 
Adult - Low 2 6 191 6.78 001 
- Reg. 2 6 191 10.59 001 
- Low 4 6 191 DONS OL 
- Reg. 4 6 191 3.97 001 


Var. 1 (Miscues/words) 


Adult - Low 2 1 196 12.06 001 
- Reg. 2 1 196 45.49 001 
- Low 4 1 196 10.41 001 
- Reg. 4 1 196 Ziros 001 
Var. 4 (Major meaning change) 
Adult - Low 2 1 196 10.97 001 
- Reg. 2 if 196 02 NS 
- Low 4 i 196 09 NS 
-Reg. 4 1 196 00 NS 
Var. 5 (Key words accessed) 
Adult - Low 2 il 196 3.67 05 
- Reg. 2 il 196 3.44 NS 
- Low 4 1 196 3.01 NS 
-Reg. 4 1 196 3.24 NS 
Var. 6 (Key units recalled) 
Adult - Low 2 il 196 8.57 01 
- Reg. 2 1 196 20.57 001 
- Low 4 1 196 ‘DD NS 
- Reg. 4 1 196 1.09 NS 


eee? EW OO TT 
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miscues resulted in major meaning change. The adults were also able to correctly pro- 
nounce more words signifying key concepts than were the grade 2 readers. The adults 
also differed significantly from the grade 2 reader groups by recalling a greater num- 
ber of key units. While the adults did not differ significantly from the grade 4 readers 
on the number of key units recalled, this does not mean that the adult illiterates were 
recalling as many units as were the grade 4 groups. What it does mean is that relative 
to the number of passages read (i.e., their instructional level), they were recalling as 
many key units as the older school age readers. The adult illiterates usually surpassed 
their reading instruction level on the second and third grade passages (similar to the 
low grade 2 and some regular grade 2 readers) while the regular grade 4 readers, for ex- 
ample, were reading passages quite comfortably up to the grade 6 level. In fact, a num- 
ber of the regular grade 4 readers did not reach frustration level on the passages read. 

The results of Tables 8 and 9 seem to indicate that unlike the low grade 2 readers 
(and in the case of key units recalled, the regular grade 2 readers), the adult illiterates 
were more sensitive to key concepts in a passage and to their potency for providing a 
recall. Thus the processing behavior of adult illiterates tended to result in more effec- 
tive reading than that of the low grade 2 readers, and in the case of recalling key units, 
they were more effective than the regular grade 2 readers. 


Conclusions and Discussion 

Adult illiterates differed from regular and low reading achievers in grades 2 and 4 in 
terms of the nature of their processing: code dominant, meaning dominant, and con- 
trol. These results also indicate the complexity of the reading process (Rigg & 
Kazemek, 1983), for within each cluster there were varying differences on specific pro- 
cesses for various groups. For example, whereas the adults did not differ from the low 
grade 2 readers in the use of inappropriate syntactic cues when synthesizing, they 
were more successful than these readers in synthesizing when appropriate syntactic 
cues were available; the opposite result occurred for the comparison between the 
adults and the regular grade 2 readers. Thus the processing of adult illiterates or any 
group of readers cannot be described in terms of a single process behavior such as a 
developmental lag in phonemic processes (Beech & Harding, 1984). In fact, the use of 
graphophonic information, at both a letter and syllable level, did not differ signifi- 
cantly across the groups studied in this research. 

In terms of differences on specific processes, the adult illiterates differed least 
often from the grade 2 readers (four differences for low two and three for regular two) 
and most often from the grade 4 readers (six differences for low grade 4 and 7 for regu- 
lar grade 4). Thus while these adults had spent an average of nine years in school (in- 
formation obtained from them), the nature of their reading processing behavior was 
similar to that of readers who have spent less than two years in school. 

However, the processing behavior of adult illiterates tended to result in more ef- 
fective reading behavior than the grade 2 readers, especially in the recall of key units. 
This may be understood in terms of the interactive nature of the reading process 
(Stanovich, 1980) and the presence of appropriate and elaborate knowledge schemas 
with which to construct a meaning for the text (Kazemek, 1983). The adults possessed 
a greater degree of knowledge about the various animals that constituted the topics of 
the passages. They frequently made comments and related personal experiences as 
they attempted to construct meaning. An example to show the interaction of one pro- 
cess (monitoring) with this prior knowledge was provided by one adult as he at- 
tempted a pronunciation for the word “built” in a sentence. “They built her a house in 
a tree.” The adult made repeated attempts at “byool,” “bile,” “boo-il” (note inattention 
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TABLE9Q 


Means and Variances by Groups on Cluster 4 Variables 
for Which There Were Significant Differences 


Cluster 4 Adult Low 2 Reg. 2 Low 4 Reg. 4 


Var.1 (Miscues/words) 
xX .O7 .05 .03 .04 .04 
Var. .00 .00 .00 .00 .00 


Var.4 (Major meaning change) 
ae AV ail 07 06 .O7 
Var. .00 .04 .00 .00 .00 


Var.5 (Key words accessed) 
xX 97 .96 99 99 99 
Var. .00 .00 .00 .00 .00 


Var.6 (Key units recalled) 
x EAH OD) .28 44 50 
Var. 03 .03 .03 .02 03 


to final letter cues), and then said “It has something to do with a house—paint—no it 
can’t be paint, the first letter doesn’t match—buy—no, made—oh, built.” In contrast, 
a low grade 2 reader read a sentence “Bob was able to help him” as “Bob was ablo to 
help him” and in the recall stated that “Bob was ablo.” The adults’ processing coupled 
with other cognitive resources such as prior knowledge allowed them to correctly pro- 
nounce most of the key words and to retrieve over four out of 10 of the key units in a 
free recall situation. One may surmise that if the passages were based on such topics as 
unemployment or abortion, the adults would have an even greater advantage than the 
school age readers in attempting to construct meaning. 

The reading processing behavior of low achieving adult illiterates is relatively 
similar to the processing behavior of readers who have spent two years in school. How- 
ever, the adult illiterates were generally more successful in constructing and recalling 
meaning from the text. Thus, while programs for low achieving adult illiterates may 
incorporate methods to engage the adults in cognitive processes similar to those of 
young school readers, these programs must also take into account other resources of 
adults such as the presence of more elaborate prior knowledge schemas. 


TABLE 10 


Behavioral Indicators of Cognitive Processes and Relating Effectiveness 


A chart was made of all uncorrected miscues, corrected miscues, repetitions of words or word 
parts, and hesitations (three-second delay or greater). The operational criteria for the various 
processes were as follows: 
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Table 10 continued 


Cluster 1 


1. 2.3. Attending to text: for substitution miscues with one-half or more graphemes in common, a 
code of 2 was assigned; less than one-half but at least 1, a code of 1; and no graphemes in common, 
a code of 0. Additions and omissions were coded as 0. A code of 2 was considered indicative of high 
attending, a code of 1 as moderate attending, and a code of 0 as low attending. Each total was di- 
vided by the number of miscues. 


4. Attending to words: this was a ratio of the number of graphemes in common between text words 
and miscue substitutions. 


5. 6. 7. Analyzing-position: the ratio of cues accessed from the beginning, middle, and end of 
words when a response is substituted, to the total number of substitutions. 


8. Analyzing-sequencing: the ratio of substituted responses to the corresponding text words in 
which two or more contiguous letters were accessed in the same sequence. 


9. Symbol-sound associating (graphemes): the ratio of phonemes in substituted responses to com- 
mon phonemes in text words. 


10. Symbol-sound associating (syllables): the ratio of correctly pronounced syllables in 
substituted responses to common syllables in text words. 


Cluster 2 
1. Meaning associating: the ratio of real word responses to the number of substitution and inser- 
tion miscues. 


2. Synthesizing-sentence (syntactic, nonsyntactic): the ratio of miscues that within a sentence 
were semantically acceptable, to the total number of miscues. 


3. Synthesizing-sentence (semantic-syntactic): the ratio of miscues that within a sentence were 
semantically and syntactically acceptable, to the total number of miscues. 


4, Synthesizing-part sentence (semantic-syntactic): the ratio of miscues that within part of a sen- 
tence (phrase) were semantically and syntactically acceptable, to the total number of miscues. 


5. Predicting-sentence: the ratio of miscues that were semantically consistent with the meaning 
of the previous sentence, to total number of miscues. 


6. Predicting phrase: the ratio of miscues to the total number of miscues that were semantically 
and syntactically consistent with the meaning of the preceding phrase only. 
Cluster 3 


1. Monitoring (corrected miscues): the ratio of the number of miscues that were corrected to the 
total number of miscues. 


2. Monitoring (hesitations): the ratio of the number of hesitations to total number of miscues. 

3. Monitoring (repetitions): the ratio of the number of repetitions to total number of miscues. 
Cluster 4 

1. Miscues: the ratio of miscues to the number of words in the text. 

2. Major meaning change: the ratio of miscues which changed the meaning of the key concepts to 
the total number of miscues. 

3. Minor meaning change: the ratio of miscues which changed the meaning of a text word other 
than a key concept, to the total number of miscues. 

4. No meaning change: the ratio of miscues that did not change the meaning of a text word, that is, 
they were synonymous with the text meaning. 

5. Key words* correctly pronounced: the ratio of number of key words (representing key con- 
cepts) correctly pronounced, to the total number of key words. 


6. Key units** retrieved: the ratio of the number of key units retrieved, to the total number of key 
units. 


*Key words (concepts) were identified by two independent raters (91.1% agreement) by select- 
ing the approximate one-third most important concepts in each text. The clauses containing 
these concepts were then considered the key units.** 
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Note 
i The research reported in this article was funded in part by a Support for Advancement of Scholarship research grant from 
the Faculty of Education, University of Alberta. 
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Redressing the Negative Influence 
of Socioeconomic Status Gender 
in a Treatment Facility School 


The potential of a remedial education program to redress the academic failure stu- 
dents experienced in community schools prior to entering a treatment facility was ex- 
amined. Test scores in five skill areas for 20 clients with an average age of 13.3 years 
and of mixed gender and socioeconomic status (SES) were compared upon entering 
and exiting from the program. All subjects in the study made significant gains in each 
skill area tested. Results indicated that remedial education in conjunction with a 
treatment program was effective in overcoming the influence of SES and gender on 
achievement. 


Numerous studies have addressed the effect of socioeconomic status (SES) and gen- 
der on academic achievement. Coleman (1966) emphasized the importance of family 
background and argued that the most powerful predictor of school performance is 
socioeconomic status. Calliste (1982) contended that the higher SES students were 
likely to have higher academic achievement, a better self-concept, and different em- 
ployment aspirations. Initial impressions which influenced all subsequent interac- 
tions (the halo effect) emphasized by manners, dress, language used, and deportment 
were all strong characteristics favoring students of higher SES. Rist (1970) noted that 
schools were not willing to diagnose the same negative behavioral traits in high status 
students as they were for lower status students. 

Similarly, gender studies have suggested a positive relationship exists between 
teacher evaluations which favor females over males. Prawat and Jarvis (1980) found 
that female high school students were given higher evaluations than were males by the 
same teachers. A study involving ethnic groups in public schools showed that female 
preference was a factor in assessing academic achievement (Clifton, 1981-1982). 
Studies addressing gender behavioral characteristics revealed that females received 
more teacher interaction as they were viewed in less threatening terms. Males, on the 
other hand, received more discriminatory interactions due to their propensity for 
overt aggressive behavior (Kedar-Voivodas, & Tannenbaum, 1979). These authors be- 
lieved that aggressive tendencies on the part of males was the most unequivocal sex 
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difference in the literature. These findings were like those of Brophy and Good (1970) 
who observed that discrimination against males was not so much a sexist reaction on 
the part of teachers, but was a reaction against the breaking of classroom rules by 
boys. 

The professional literature also indicates that teacher assessments of students 
based on second-hand information obtained from reports, school reports, and psycho- 
logical assessments can be negative if past histories indicate poor performance 
(Rist, 1973), or positive if they are described in terms of high achievement. High 
achievers experienced more quality interactions than those students considered to be 
poor achievers (Prawat & Jarvis, 1980). 

This study examined the ability of a remedial education program within the con- 
text of a treatment facility to redress such negative aspects of SES and gender on aca- 
demic achievement experienced by each client in the community school system. The 
subjects were a specialized population which had been academically and intellectually 
assessed by school board personnel, and had all been designated as underachievers 
functioning below expected grade level. The group included upper SES females and 
lower SES males and females. The upper range females exhibited the negative effects 
of failure to live up to the higher expectations placed upon them as upper SES fe- 
males. The lower SES group brought with them the most severe negative aspects of 
SES interactions since they were part of the lower strata of an already deprived group. 
Thus the population of the study possessed most of the negative characteristics 
outlined in the SES and gender literature. The focus of this project was to determine 
if such a group could achieve academic success in a treatment environment in spite of 
their past negative school experience. 


Method 

The way in which the project was designed and undertaken is described below. 
Subjects 

The subjects consisted of 20 clients: 11 females ranging in age from 11.2 years of age to 
16.1 years (average 13.7 years) and nine males ranging in age from 10.2 years of age to 
15.4 years (average 12.8 years). The average length of time spent in the program was 
10.4 months. All of the clients were involved in a treatment program at Thistletown 
Regional Centre for Children and Adolescents, Toronto, during the years 1981-1982, 
and all attended an on-site school as part of their therapeutic program. The subjects 
exhibited some or all of the following behavioral characteristics: verbal/physical ag- 
gression against peers and/or adults, delinquency, suicidal gestures, deviant behavior 
at home/school/community, truancy, and school failure. All had been involved with a 
variety of social, legal, and educational authorities prior to admission to the treatment 
program. 

Each client or subject was required to attend school daily and was placed in a 
class with up to four other peers of either gender, and received instruction from one 
teacher. Individual programming was the mode of instruction. If the class consisted of 
unusually aggressive students, a teacher’s aide was assigned. All of the academic 
subjects were taken in the same classroom with the assigned teacher. Students moved 
to other locations for instruction in woodworking, art, and physical education on an 
average of two times a week. 
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Testing 

Each subject was retested by a diagnostician upon entering the program in order to 
ascertain functioning levels for reading comprehension, mathematics, language, oral 
reading, and spelling. The purpose of the testing was to identify a performance level 
for each client in each of the skill areas so that a remedial plan could be developed by 
the diagnostician in conjunction with the classroom teacher. The remedial education 
plan was incorporated into the treatment goals, and was addressed at each review con- 
ference held every six weeks. Ongoing informal evaluation was conducted by the 
teacher in order to determine each client’s progress. On discharge from the program, 
the student was retested by the diagnostician in order to determine the degree of aca- 
demic progress in each of the skill areas. Due both to the length of time between the 
two formal testing procedures and the fact that a different form of the Metropolitan 
Achievement Test was used, the practice factor was not deemed to have inflated the 
scores. Since the anxiety level of clients increased as discharge time approached, the 
final scores may have been slightly depressed. 


Instruments 

The Metropolitan Achievement Test was used to determine pre- and postgrade level 
scores for reading comprehension, mathematics, and language. Another measure of 
reading achievement was obtained using the Slosson Oral Reading Test. Spelling per- 
formance was assessed by the Morrisson McCall Spelling Scales. The Blishen socio- 
economic index (Blishen & McRoberts, 1976), was used to determine the socioecon- 
omic prestige ranking for the working parent of each client. 


Procedure 

The subjects were divided into five different groups for comparative purposes: all 
males and all females; upper range SES clients and lower range SES clients; upper 
range females; lower range females; and lower range males. There were no upper range 
males available for the study. The distinction between upper and lower range SES 
groups was determined by adopting the median point between the highest and the 
lowest rating of the working parental figure who had the higher index rating according 
to the Blishen scale. 

An SES package was used to generate comparison tables for the progress of each 
client using the differences between pretest and posttest scores for reading compre- 
hension (RC), mathematics (M), language (L), oral reading (OR), and spelling (S). 
T-tests were used to evaluate the degree of progress made by each student in each of 
the skill areas and Anova tables compared the groups on the basis of gender and SES 
in order to determine if significant differences were obtained by comparing one group 
with another. 


Results 

The correlated t-test results indicated that all of the students in the study made sig- 
nificant gains in each of the skill areas tested. All of the t-tests’ values were significant 
at the .01 level. When all males were compared to all females in terms of achievement, 
female students made significant gains in mathematics (t=2.70, p<.0223) and in 
reading comprehension (t= 2.70, p<.0223), and less significant gains in spelling, oral 
reading, and language (see Figure 1). Males made significant gains in reading compre- 
hension (t=5.41, p<.006) and oral reading (t=3.34, p<.0132), and less significant 
gains in language and mathematics (see Figure 2). 
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pre-test mean score 
post-test mean score 


Figure 1: Females: (N-11), Pre- and Posttest Scores Comparisons 


Grades 


Skills RC OR L M Ss 


pre-test mean score 
post-test mean score 


Figure 2: Males: (N-9), Pre- and Posttest Scores Comparisons 
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The SES results reveal that the lower group made significant gains in reading 
comprehension (t=4.17, p<.0008), oral reading (t=4, p<.0016), and spelling 
(¢=3.35, p<.0074, see Figure 3). The upper SES students made significant gains in 
mathematics (f=2.99, p<.0203) and moderate gains in all other areas with the excep- 
tion of oral reading (see Figure 4). 

Group comparisons on the basis of performance for SES and gender suggest that 
although the lower range of females improved in their performance, there was no sig- 
nificant difference between their performance and the upper range SES group. When 
the results achieved by all males were compared with all females, the latter scored 
higher in mathematics (F(2.17) =7.08, p<.0165). 


Discussion 

The results of this study suggest that SES and gender were not influential factors af- 
fecting academic performance of clients in a treatment facility school. This is a 
marked departure from research findings that show SES and gender as variables af- 
fecting academic achievement in the community school system. Whereas much of the 
literature suggests that higher SES students receive higher grades than lower SES 
students, the findings from this investigation reveal that in a specialized setting the 
upper SES group did not receive a more positive evaluation than the lower SES stu- 
dents as there was no significant difference between the scores of the two groups. 
Similarly, the view that underachieving students whose school records present a his- 
tory of negative behavior receive little positive teacher interaction, appears to be con- 
tradicted by the finding that lower range SES clients improved in more of the skill 
areas than did the upper range SES group. Likewise, the claims that males experience 
more discriminatory interactions and achieve fewer academic gains than females and 
that males resort more frequently to aggressive and rule-breaking conduct than fe- 
males and thereby are subjected to negative teacher discrimination, are challenged by 
the lack of difference in the achievement of females and males observed in this study. 

The findings and context of this investigation tend to support Brophy’s (1983) as- 
sertion that teachers can relate well to students given an accurate initial assessment of 
their potential and when the learners are open to “corrective feedback.” Teachers in 
the facility environment approached their task with fewer expectations of their stu- 
dents to perform; thus the emphasis upon their immediate learning needs may have 
helped to overcome the effects of SES and gender. The smaller classes provided a less 
stressful environment, along with greater individual attention thereby building 
student confidence and facilitating more immediate feedback on performance. 

There are a number of other factors which may explain why the effects of gender 
and SES were less apparent in this study. The teachers and clinicians in the project 
tended to be more empathetic toward the students because of their awareness of the 
effect negative gender and SES attitudes had on clients when they were attending 
community schools. This desire to improve the students’ negative self-images became 
an important ingredient of the treatment plan. It was reflected in the collaboration 
approach of teacher and clinician in support of the goals of both education and 
treatment. In many cases, educational demands placed upon the students were initi- 
ally subordinated to treatment needs, and the client was introduced to educational 
expectations on a gradual basis which proved to be less stressful than those previously 
experienced in the community school system. When educational demands were made, 
it was in a small supportive classroom setting where expectations for performance 
were less threatening and less competitive. 
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Figure 3: Lower Range SES, (N-14) 
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Figure 4: Upper Range SES, (N-7) 
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Another influential factor was the low teacher-pupil ratio which allowed for indi- 
vidual instruction. This enabled the teacher to address more effectively the goals of 
the educational plan. The fact that progress towards these goals was reviewed every 
six weeks reinforced their motivation. The treatment setting also afforded more con- 
trol over the educational process. When students were removed from the classroom 
back to the residence because of unacceptable behavior, their work was still required 
to be completed. Thus gaps in instruction and learning were minimized, and failure to 
follow through could result in sanctions being imposed on the client until the assign- 
ments were completed. 

The outcomes of this study of the Thistletown Project indicate that it is possible 
to redress or overcome the negative influence of SES and gender on student achieve- 
ment. The extent to which the gains observed will be sustained, diminished, or pro- 
moted in other learning environments remains to be investigated. 
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BOOK REVIEW 


THE INCOMPETENT TEACHER, By Edwin M. Bridges. Philadelphia, PA: 
Falmer, Taylor and Francis, 1986, 191 pp. 


This book will interest administrators, school board members, teachers, and others re- 
sponsible for dealing with incompetent teachers. It describes the successes and fail- 
ures of administrators who attempt to eliminate poor teachers from the classroom. It 
also presents a plan that might eventually rid the teaching profession of its undesir- 
able members. 

I found this book to be a pleasure to read. Bridges makes good on his promise to 
write a book that is informative, interesting, easy to read, lacking in technical jargon, 
and without complicated statistical analyses. An average reader should have little dif- 
ficulty in completing the reading of this publication in two evenings. Each of the six 
chapters is nicely summarized for the reader who is pressed for time or is interested in 
reviewing only key findings. 

Aside from an extensive review of the literature, the data used by Bridges to com- 
pose this book were acquired from interviews with administrators and from question- 
naires received from administrators of randomly selected school districts. A third 
source of information was a case study of a small district which was considered suc- 
cessful in dealing with teacher incompetence. Since all data were collected in the state 
of California, some might argue that the findings in this book are not applicable to 
other regions. It is difficult to take this criticism too seriously because teacher incom- 
petence is known to transcend political boundaries, and, also, the legislation which 
protects tenured teachers is quite uniform across North America. 

The author estimates that roughly five percent of teachers are incompetent. He 
found that the term “incompetence” lacks precise technical meaning and neither 
legislatures nor courts of law have provided guidelines that assist administrators in 
determining poor teaching performance. When asked to give their opinions, adminis- 
trators stated that incompetent teachers usually failed to: maintain discipline, treat 
students properly, impart subject matter effectively, accept teaching advice from 
superiors, master subject matter, and produce the intended or desired results in the 
classroom. The research conducted by Bridges revealed that administrators did not 
often deal effectively with teachers evidencing these behaviors. 

The second chapter describes how and why poor teachers are tolerated and pro- 
tected. It comes as little consolation to learn that doctors, lawyers, and members of 
other professions are equally skillful at covering up for their inept colleagues. Three 
reasons are given for why administrators are reluctant to confront incompetent teach- 
ers. First, they are aware that the employment rights of tenured teachers are pro- 
tected by legislation and the burden of proof as to whether a teacher is incompetent 
rests with the school district. A teacher possessing a permanent contract enjoys a 
property right to that position and is entitled to due process of the law if termination 
is being considered. The considerable expense associated with documenting a case 
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against an incompetent teacher, coupled with the possibility that a school district may 
lose the judgment, causes some administrators to overlook poor performers. 

The ambiguous nature of teacher evaluation is cited as a second reason why ad- 
ministrators do not purge incompetents from school classrooms. There are no 
clear-cut standards to guide administrators in determining teaching incompetence. 

Third, administrators often choose to avoid the discomfort and unpleasantness 
which accompanies the dismissal of an incompetent teacher. In place of direct action, 
the administrator may use a roundabout approach such as assigning the teacher to a 
less than desirable working environment. 

Ineffective administrative practices are seen as contributing to the professional 
longevity of incompetent teachers. When some administrators make their infrequent 
classroom visits, the occasions are used to offer ceremonial congratulations. Seldom 
do these administrators address the improvement of instruction. Teachers are often 
provided with inflated ratings and their reports contain double talk in that positive 
statements are used to describe poor performance. 

Several factors are identified which force administrators to deal with incompe- 
tent teachers. Parental complaints about teachers, when numerous and in writing, 
often spur administrators into action. Most complaints about teachers originate with 
middle-class parents. Lower-class parents seldom voice their concerns, while disen- 
chanted wealthy parents generally transfer their children to private schools. Teacher 
layoffs due to declining enrollments or fiscal cutbacks sometimes induce administra- 
tors to use teaching performance rather than seniority as a basis for eliminating posi- 
tions. Public concern is often voiced when young capable teachers are laid off and 
long-term, incompetent teachers are retained. Smaller districts with limited finances 
are more likely to use performance than seniority as the basis for layoffs. Larger and 
more affluent districts tend to accommodate staff cutbacks through attrition and fre- 
quently hide their incompetent teachers through transfers to positions which are less 
likely to draw the ire of parents. In those districts where teacher evaluation is taken 
seriously, administrators are inclined to take action against incompetent teachers. 

When a lion-hearted administrator decides to confront an incompetent teacher, a 
series of events generally follows. Chapter 3 summarizes the salvage attempts used by 
administrators to rehabilitate poor teachers. Initially, an administrator provides a 
teacher with a detailed description of what is happening in the classroom. In a written 
report the evaluator is often critical of the teacher’s performance. The report also 
specifies what the teacher must do to become competent. Expected tasks and behav- 
iors are spelled out and a lesson-planning model is frequently used to direct the teach- 
er. Upon receiving the report, the teacher often rejects the criticism, becomes 
defensive and antagonistic, and may even launch a counterattack against the adminis- 
trator. The poor performer is offered opportunities to improve by observing excep- 
tional teachers, attending inservice sessions, and conferring with consultants. Because 
of the possibility of future legal action against the teacher, the administrator seldom 
offers any form of positive reinforcement which might weaken the district’s case at a 
dismissal hearing. Although the evaluation reports are written for the benefit of the 
teacher, they are also intended to satisfy the adjudicator who might later uphold or 
deny the teacher’s dismissal. 

Bridges’ research revealed that most attempts to salvage incompetent teachers 
fail. This prophecy could be self-fulfilling considering the lack of praise or encourage- 
ment offered by administrators to teachers during evaluations. But even staff devel- 
opment specialists who spend considerable time and effort in working with poor 
teachers report that they are disillusioned by their attempts to remake substandard 
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teachers. The findings in this book are gloomy in that the performance level of most 
poor teachers seldom improves to an acceptable level in spite of the remedial pro- 
grams that are offered. This leads one to believe that providing assistance to incompe- 
tent teachers is a waste of valuable resources and is merely posturing while the 
administrator builds a case for dismissing the poor performer. 

The dismissal of incompetent teachers is one option which is available to 
adminstrators, but it is not uncommon for an administrator to consider other ways of 
removing poor teachers. Administrators often exert pressure on marginal teachers to 
resign or retire early. Through indirect means, an administrator may change a 
teacher’s working conditions, such that it becomes undesirable for that person to re- 
main in teaching. If methods are used, the administrator may issue negative evalua- 
tions, use threats and intimidation, place the teacher on a remediation program, or 
serve notice of intent to dismiss. 

Most induced exits entail negotiations between the teacher who is under fire and 
an administrator. The approach taken by the administrator in the negotiations ap- 
pears to depend on his or her personality. Some administrators attempt to address the 
needs of the teacher and uphold his or her dignity, while other administrators are 
hardliners who employ intimidation tactics. Most teachers receive some type of in- 
ducement to resign but their settlements are usually less than the long-term savings 
realized by the district. 

Bridges found that teachers’ unions play a unique role in the dismissal of incom- 
petent teachers. Although unions owe their members the right to fair representation 
and often provide this service, their officers do not render the same level of service to 
everyone. In cases where documentation revealed the teacher to be clearly incompe- 
tent, the union cooperated with administrators in counseling, advising, or persuading 
the teacher to resign. 

A case study is presented to illustrate that a school district can systematically rid 
itself of poor teachers. The district under study had three elementary and junior high 
schools and served an isolated, suburban community which was relatively affluent. 
The drama unfolded when a number of younger but popular teachers were threatened 
with dismissal due to staff cutbacks. A citizens’ committee pressured the superinten- 
dent and school board into using teaching performance rather than seniority as a basis 
for reducing staff. In response to this pressure, and after receiving a considerable do- 
nation of money from the private sector, the superintendent and school board initi- 
ated a plan to improve teaching performance and remove incompetent teachers. 

The evaluation of teachers in this district was performed by principals and was 
based on management by objectives. Administrators spent considerable time observ- 
ing teachers in their classrooms and surveying students and parents. When teaching 
deficiencies were identified, teachers were provided with additional supplies, equip- 
ment, advice, and development opportunities to help them improve. Those teachers 
who did not measure up to the standards set by the district were advised to resign or 
take early retirement. Consulting firms were employed to counsel teachers into alter- 
nate careers and to prepare principals for the emotional confrontations associated 
with teacher dismissals. 

The plan used by this district to deal with incompetent teachers appears to have 
worked quite well. After four years of implementation, the district had cut nine teach- 
ing positions and the teachers who left were the weakest. The union proved helpful to 
the administration by not filing any grievances. 

The author concludes his book by reviewing present practices of recruiting, devel- 
oping, and evaluating teachers. He is critical of the status quo because insufficient 
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attention is given to promoting effective teaching. The undesirable by-product of 
present practice is an assured supply of gainfully employed incompetent teachers. In 
musing about what might occur in the future, the author proposes a course of action 
that may reduce the number of poor teachers occupying the nation’s classrooms. The 
attainment of his objective is linked almost exclusively to the recruitment of prom- 
ising graduates from teacher education programs and the subsequent development of 
these teachers in their pretenure years. For reasons which are not immediately appar- 
ent, Bridges chooses to ignore the continued professional development of the tenured 
teacher. 

The selection procedures proposed by Bridges for recruiting teachers are rigorous 
and based on performance criteria. Once hired, probationary teachers face tough ten- 
ure provisions. To achieve the high standards set by districts, new teachers are pro- 
vided on a voluntary basis with extensive assistance to help them develop. To use the 
author’s words, the system of evaluation is “hard on the standards but soft on the 
people.” He predicts that in spite of rather stringent recruitment practices, only a mi- 
nority of probationary teachers will be granted tenure. This forecast is disturbing 
when one considers the human and fiscal costs associated with training and develop- 
ing beginning teachers. One might argue that the author’s pessimistic prediction is 
alterable and that the ratio of probationary teachers gaining tenure can be increased 
significantly if beginning teachers are required to participate in developmental activ- 
ities. The voluntary system of professional development which is advocated by 
Bridges, although democratic, may weaken the probationary teacher’s chances of 
success. How many other professions use a similar hit-and-miss approach? 

This book is well written, informative, and does a superb job of describing how 
administrators deal with incompetent teachers. In designing a future that is devoid of 
teacher incompetence, the author provides some useful ideas for selecting teachers 
and planning for their development. His proposal is incomplete in that it fails to men- 
tion how school districts can deal effectively with tenured teachers whose competence 
is on a downhill slide. Perhaps this shortcoming will be addressed in a later publica- 
tion. Reading this book is a must for administrators, school board members, teachers, 
and others who wish to address seriously the problem of teacher incompetence. 


Russ Wiebe 
Lethbridge Regional Office 
Alberta Department of Education 
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The Alberta Journal of Educational Research Vol. XX XIII, No. 3, September 1987, 149-149 


THE EDITOR’S PAGE 


Toward a Science of Emerging Order 


It is currently fashionable for researchers to be open and sensitive to emerging themes 
both during and after data collection, particularly in studies in which the researcher 
becomes immersed in the setting. Although some major insights have emerged from 
such studies, the hope that new themes will emerge is often very little more than just 
that: a hope. Perhaps we need to develop a “science of emerging themes” as an aspect 
of methodology in the human sciences. Such a science could help us not only have 
more confidence that themes will emerge, but to have more confidence in the themes 
themselves. More importantly, such a science may also help us in articulating the slip- 
pery distinction between themes that “emerge” and themes that are “imposed” by the 
researcher. 

What might such a science look like? Clearly there is no single perspective, from 
which to construct such a science. However, from my perspective emerging themes in 
human research is not an isolated phenomenon unique to human science, but rather is 
a phenomenon pervasive across all science. Seen in this way, the search for a science of 
emerging themes can draw on the wider experience of scientists everywhere, some of 
whom may prefer to call themselves philosophers, artists, historians, or mathemati- 
cians. The label is not the significant indicator. The significant indicator is that they 
are all interested in the wider phenomenon of emerging order. 

How then might we approach a science of emerging order? In attempting to an- 
swer this question I can do little more than ask other questions. How do we know 
when order has emerged? How do things become ordered? Where does order come 
from? Is there a beginning to order? Does order come from order, from chaos, or from 
nothing? Does order begin with time, or does time begin with order? Can there be 
order without repetition? Poets, physicists, anthropologists, theologians, and others 
have all asked similar questions. I am in particular reminded of the work on the Laws 
of Form done in logic; of the relations between order and chaos done in mathematics; 
of the Superstring Theory of everything, including a theory of time before the Big 
Bang; of the Theory of Dissipative Structures describing how order emerges from 
chaos; of spontaneous social order emerging in autopoietic systems. Unfortunately, I 
am unable to cite any major work in education: Rare is the educator who asks these 
questions. It is as if education is taken to be an “applied” science or art; a domain of 
activity that takes from base disciplines but has no original base of its own. Not to ask 
about the origins of order is to assume that education is indeed a derivative phenome- 
non. To ask about order is to assert that education is a phenomenon of its own with its 
own need to pursue the basic question: How does order emerge in the domain of edu- 
cation? 


Daiyo Sawada 
Faculty Advisory Committee 
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HUGH LYTTON 
SUSAN ROGERS MAUNULA 


and 


DENISE WATTS 


University of Calgary 


Moral Judgments and 
Reported Moral Actions: 
A Tenuous Relationship 


Relationships among different measures of moral judgment, and between moral judg- 
ment and reports of moral behavior, were investigated ina normal population by means 
of two related studies. In Study I, the results of an objective moral reasoning test 
(MAT) were related to teachers’ and parents’ reports of the moral behavior of 9-year- 
old twin boys. In Study II relationships among scores on MAT, Kohlberg’s Moral 
Dilemmas (MDI), one of Damon’s tests, and reported delinquency-proneness were ex- 
amined in 10-year-old boys. Once verbal ability was partialed out, MAT no longer pre- 
dicted moral behavior in Study I, and neither MAT nor the free-response tests 
predicted delinquency-proneness in Study II. Moral reasoning tests were, however, 
fairly consistently related to verbal ability. 


The main theme of this article is the relationship between the behavioral, emotional, 
and judgmental aspects of morality. It thus offers an assessment of the validity of cog- 
nitive developmental theory as it applies to the moral behavior domain. A second 
theme is an examination of the relations among different methods of assessing moral 
judgment. 

Kohlberg’s well-known Moral Dilemma Interview (MDI) deals with the aspect of 
moral judgment or reasoning by means of an open-ended, free-response method which 
classifies subjects’ reasoning about the moral dilemmas posed to them into six stages 
of moral development. Various criticisms have been leveled at the test, for example, 
lack of evidence for the sequentiality of the stages (Kurtines & Greif, 1974), low test- 


Dr. Lytton is a professor of educational psychology with research interests in early social development, parent-child interaction 
and behavior genetics. 


Ms. Rogers Maunula and Ms. Watts are both school psychologists with the Calgary Board of Education. 
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retest reliability (Rubin & Trotter, 1977), or vagueness of the scoring rules (Kurtines 
& Greif, 1974). The latter criticism has been muted by the recent development of 
more tightly formulated scoring rules (Colby, Gibbs, Kohlberg, Speicher-Dubin, & 
Candee, 1979). With the support also of some countervailing evidence, provided for 
instance by a 20-year-old longitudinal study (Colby, Kohlberg, Gibbs, & Lieberman, 
1983), the test continues in high vogue. 

Damon (1977) also developed a free-response method of assessing moral reason- 
ing, but his stories and stage structure differ from Kohlberg’s in that they focus on 
“fair distribution and sharing of valued resources” which he considers the central con- 
cern of morality. He also felt a need for more child-oriented stories for work with 
younger children. 

Because of certain problems with the free-response interview method, various 
objective-format moral reasoning tests based on Kohlberg’s dilemmas have been de- 
veloped, for instance, Rest’s Defining Issues Test (DIT Rest, 1979). Carroll (1974) de- 
veloped a multiple-choice format moral reasoning test, to be described below, as an 
adaptation of the DIT suitable for younger children. The advantages of the objective 
format, according to Rest (1979), are that assessment should be less affected by a 
subject’s verbal expressiveness, and that the test and its administration are standard- 
ized as much as possible. 

Another approach to assessing the internalization of moral values, adopted by 
Hoffman (1971), is to test the child’s feelings of guilt regarding the misdemeanors of a 
projective story character with whom the child is expected to identify. Such guilt rep- 
resents the emotional side of morality, or moral affect, and its intensity, following 
Hoffman (1971), is an indication of the child’s internalization of moral rules. 

Our main topic—the relation of moral judgment to action—has been one of the 
persisting themes and concerns in this area. Blasi (1980) firmly believes that whereas 
action should be the final criterion of moral development, moral cognition, too, plays a 
central role in moral functioning. Kohlberg (1978) considers, on the basis of his own 
evidence, that judgment and action are related, and takes the position that mature 
moral judgment is a necessary but not sufficient condition for mature moral action. 
He suggests that factors such as the situation and its pressures and the individual’s 
“ego strength” may affect a person’s actions over and above his level of moral matur- 
ity. Jurkovic (1980), in considering juvenile delinquents, concludes from a review of 
the evidence that, overall, there is a relationship between moral behavior (or its con- 
verse) and level of moral reasoning, and he attributes heterogeneity of moral reason- 
ing within delinquent samples to personality variables. Mischel and Mischel (1976), 
on the other hand, expect poor prediction from attitudinal measures on the general 
ground of the specificity, or discriminativeness, of behavior. 

In reviewing the literature on delinquency and moral reasoning, Blasi (1980) con- 
cluded that the evidence on the whole showed that moral judgment and action are re- 
lated, with the qualification that the evidence is stronger in some areas than in others. 
He found that in all three studies that employed objective tests of moral reasoning 
(Rest, 1970; Carroll, 1974) as criteria, no differences between delinquents and 
nondelinquents were discovered, whereas nine out of 11 studies that used Kohlberg’s 
MDI yielded positive findings. McColgan (1976, cited by Jurkovic, 1980) concluded, 
on the other hand, that the DIT had greater power in discriminating delinquents from 
nondelinquents than the MDI. Kohlberg (1978) found teachers’ reports of children’s 
moral behavior or conscience to be related to the MDI, but Santrock (1975) noted no 
such relationship. While the majority of findings are positive, the question is clearly 
not settled. 
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We pursued this question in two related investigations. In Study I the relation- 
ships among moral judgment, moral affect (fantasy guilt), and reported moral behav- 
ior were examined in the context of a*broader investigation covering many different 
aspects of children’s development (Lytton, Watts, & Dunn, 1986). In light of our expe- 
rience in Study I, the field work of which was completed in 1980, Study II (Rogers, 
1981) was designed specifically for the purpose of investigating the relationships 
among a variety of different measures of moral judgment, and between them and a re- 
port measure of antisocial tendencies regarded as the converse of moral behavior (cf. 
Blasi, 1980). The present set of studies adds to the relatively sparse literature relating 
moral reasoning to action in a normal population. 


Study I 

Study I was part of a larger investigation that examined the social and cognitive devel- 
opment of twins longitudinally from age 2 to age 9. One of the facets of this develop- 
ment is the degree of moral internalization that has taken place. Our purpose was to 
determine this and relate it to the more rudimentary form of internalized standards 
assessed at age 2. (Lytton, 1980, presents an account of the investigation at age 2.) 
Both moral internalization and overt compliance were assessed by reports of behav- 
ior. But it was hypothesized that level of moral reasoning and fantasy guilt would also 
be valid indicators of moral internalization. Two instruments, the Guilt Index 
(Hoffman, 1971) and the Moral Advice Test (Carroll, 1974), were used for this pur- 
pose. 


Method 
Sample 
Thirty-five male twin pairs, both identical and fraternal, (out of 46 pairs studied at 
age 2) could be traced and agreed to participate in the complete follow-up procedure. 
Their mean age at the time of testing was 9 years 6 months, and the range 8 years 5 
months to 10 years 8 months. The children came from both working class (two-thirds) 
and middle class (one-third) families. 

The inclusion of twins in the larger investigation was relevant to the goals of that 
study, which included genetic factors, but has no special bearing on the studies of 
moral development presented here. Only boys were included in the earlier investiga- 
tion in order to eliminate sex differences as a possible confounding factor in the study 
of parent-child interaction. Because the sample was thus restricted to males, the find- 
ings on moral development should not automatically be generalized to females. 


Procedure 
Measures of moral judgment and attitudes 
(a) The Guilt Index (GI) was a semiprojective story completion task, based on the 
work of Hoffman (1971) and of Allinsmith (1960). One of the authors (Denise Watts) 
wrote an additional item to include, as heroes, a pair of twins. The Guilt Index con- 
tained four stories involving moral decisions related to lying, cheating, stealing, and 
failing to assist a person in distress. Each story beginning was read to the child in indi- 
vidual sessions and the subject was asked to finish the story by describing how the 
hero would feel, and so on. The child’s responses were recorded on tapes which were 
later transcribed in full. 

In scoring the responses we adopted the scoring guide provided by Hoffman (per- 
sonal communication, 1979). A score on a 3-point scale was assigned according to the 


maximum level of guilt shown, ranging from severe to no guilt, and the scores for the 
four stories were summed. 
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Interrater reliability was established on 12 cases which were scored independently 
by two raters. The interrater reliability coefficient over the 12 subjects was .88. 

(b) The second moral development measure, the Moral Advice Test (MAT), was a 
test of internalized moral judgment. The test was adapted from the work of Carroll 
(1974), with some minor changes in order to allow for better comprehension by our 
younger subjects. Carroll’s test is a modification of Rest’s (1976) Defining Issues Test. 
It presents the subject with four Kohlberg-type moral dilemmas, each followed by 10 
“advice” statements which give reasons for or against a certain course of action relat- 
ing to the story. Each of Kohlberg’s five stages of moral thought (his Stage 6 being 
viewed as an extension of Stage 5) is represented by two statements. For example, for 
the story concerning the publishing of a student newspaper protesting against some 
school rules in the face of the principal’s prohibition, the Stage 3 statement runs: “You 
feel badly about the principal’s problems, but you will feel much worse about letting 
your friends down. As long as you have your friends’ support you should publish.” The 
child is asked to rate this, and all other statements from “very good” to “very bad.” 

Carroll and Rest (1981) report Hoyt internal consistency coefficients that were 
higher at the lower stages (ranging from .74 at Stage 1 to .66 at Stage 3) than at the 
more advanced stages. While the test has face validity, the only evidence Carroll and 
Rest provide justifying the use of the test as a test of moral reasoning is (a) that in 
their study it correlated with achievement test scores (about .40) and (b) that rejec- 
tion of lower stage reasoning showed a developmental age trend, that is, it increased 
with age. The test was chosen because it was one of the few individually administered 
tests of moral judgment which could be given to children as young as those in our 
study. The rating of statements furnished by the examiner was thought to be easier 
for young children than producing reasons themselves, something which Kohlberg’s 
procedure requires. 

The stories and the advice statements were read out to the children individually 
and the rating for each advice statement was recorded. Carroll and Rest suggest that 
rejections of lower-stage reasons would furnish an assessment of children’s moral de- 
velopment. Following Carroll, therefore, originally a total rejection score for lower 
stage moral thought was calculated by summing the rejection scores for Stages 1 to 3. 
Because the analysis based on this total rejection score generated results that were in 
a direction opposite to theoretical expectations, we instead used “acceptance ratings” 
for Stages 4 and 5 as the summary statistics (i.e., “very good” was scored 4 on a 4-point 
scale). We thus assessed moral development by the acceptance of the more advanced 
levels of moral thought rather than by the rejection of lower levels. This analysis is 
presented below. 

Measures of moral behavior 

(a) The class teachers were given a 5-point rating scale which, among other character- 
istics, asked for ratings of the children’s overt compliance and conscience, each de- 
fined in behavioral terms. “Compliance” was defined as overt compliance with 
teachers’ directives and school rules. “Conscience” was defined as standards that the 
child had internalized in himself or herself and was measured by questions such as: 
“When he has been caught doing something wrong, how often does the child accept 
blame?” 

(b) Mothers and fathers were given interviews which probed their perceptions of 
their children’s characteristics, as well as their own child-rearing practices, with ques- 
tions paralleling those used in the interview at age 2. Ratings on a 5-point scale for 
overt compliance with parents’ requests and internalization of parental standards 
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(“Conscience”) were given by the interviewer, based on specific questions relating to 
these facets of the children’s behavior. Questions regarding compliance included: 
“When you ask X to do something for you, does he do it right away, or do you have to 
ask him several times?” Questions regarding conscience included: “Does X usually 
obey the rules you have laid down without having to be reminded, or does he have to 
be reminded?” Two investigators independently rated six interview schedules. The 
interrater reliability coefficient for compliance was .94, for conscience .71. 

(c) Teachers and parents were given the teachers’ and parents’ form, respectively, 
of the Children’s Behaviour Questionnaire (CBQ, Rutter, 1967; Rutter, Tizard, & 
Whitmore, 1970) which is designed as a screening instrument for children’s 
maladaptive behavior. This measure, in addition to a total maladjustment score, 
yields a subscore for antisocial behavior. Rutter, Tizard, and Whitmore (1970) report 
satisfactory reliability and validity for these scales. 

Intelligence and achievement measures. Intelligence and achievement were 
assessed by the published measures shown in Table 1. 


Results 

Individual twins’ scores are not independent of one another; for instance, they have 
the same values for demographic variables (e.g., sex, age, socioeconomic status [SES], 
etc.) on which children may be selected. Moreover, ratings of both twins are based on 
reports by mother (or father), that is, are derived from the same person, and the rat- 
ings of mothers’ (and fathers’) own characteristics, for example, warmth to each twin, 
are ratings of the same person. Correlations between twins for most characteristics 
were substantial in this study, reaching, for instance, .85 for antisocial tendencies 
among MZ pairs. To enter each twin’s score separately into an analysis, therefore, 
would often amount to entering the same score twice. This leaves one with the option 
of either randomly sampling from the twin pairs, or of using the mean value of each 
twin pair as the unit of analysis. We have opted for the second approach for the reason 
that the twin pair mean is a more reliable measure of the twins’ characteristics than is 
a randomly sampled member of the pair. Kraemer and Jacklin (1979) also consider 
that when one is dealing with nonindependent dyads, the use of pair means as the unit 
of analysis is a valid method, whereas entering both scores separately is not. 

Table 1 shows the results for Study I. The correlations of the acceptance ratings 
(AR) are in the expected direction. Moral judgment is associated significantly (by in- 
dependent tests) with SES, IQ, and achievement, and it predicts antisocial conduct 
negatively, and compliance and conscience positively. The correlations of the ARs of 
Stage 5 and of Stages 4 and 5 combined (not shown) are very similar. 

The significance levels are based on individual significance tests. Given a fairly 
large number of tests, this is not strictly legitimate and these results must therefore be 
considered as tentative only. By the more rigorous standards of simultaneous confi- 
dence testing (Larzelere & Mulaik, 1977) the alpha level for accepting significance of 
individual correlations should be .003 to yield an experiment-wise alpha of .10 for 32 
tests together. Under this procedure no result achieves significance. 

For the GI only the correlation with the antisocial score (mother’s rating) is signif- 
icant by independent test, but not under simultaneous confidence testing. All other 
correlations, apart from that with age, are near zero, though generally in the expected 
direction. The GI’s correlation with the MAT score is also near zero, thus indicating 


that the two tests measure different aspects of moral development: the affective 
versus the cognitive. 
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TABLE 1 


Correlations of Guilt Index and Moral Advice 
Test with Other Child Measures 
Twin Pair Means (n: 35) 


ae cent rede ee ee eee oe eee 
Guilt Index Moral Advice 


AR4 
Guilt Index — = 206 
Age Doe OR 
SES 03 ape 

Intelligence and Achievement Measures 
Verbal IQ (Crichton Vocab. Test®) 03 gles 
Nonverbal IQ (Raven’s Matrices®) = 30) 30" 
Total IQ =—103 iA4** 
Total PIAT? —.08 38* 
Failed Grade — 1 se if 
Moral Behavior Reports 

Antisocial Score (CBQ—Teacher) = U6 Ales 
Compliance (TR) 14 10) 
Conscience (TR) 15 wail 
Antisocial Score (CBQ—Mother) ao =] 
Compliance (MR) 2 al 
Conscience (MR) .20 ee 
Antisocial score (CBQ—Father) wal = 336 
Compliance (FR) — Xml Aste) 
Conscience (FR) ale 24 


Note: TR: Teacher rating; MR: Mother report; FR: Father report; CBQ: Child Behavior 
Questionnaire; AR4: Acceptance rating, Stage 4. 
4 Raven, Court, & Raven (1977): 
Peabody Individual Achievement Tests (Dunn & Markwardt, 1979); 
© n for fathers’ report: 28. 
*p<.05, **p<.01—2-tailed test. 


Because the MAT correlated .43 with Verbal IQ it was of interest to see whether 
this relationship accounted for the correlations between moral reasoning and moral 
behavior that we discovered. When Verbal IQ was partialed out from these latter cor- 
relations, we found, indeed, that no correlation between MAT and the behavioral 
measures remained significant, even by independent tests. Verbal IQ, however, did 
not account for the individually significant correlation between GI and Antisocial 
score (mother’s report), because Verbal IQ correlated only .03 with GI. 

Internalization of standards assessed behaviorally at age 2 (cf. Lytton, 1980) did 
not predict either the MAT or the GI, that is, it did not appear to be the precursor of 
cognitive moral development. 


Study II 
In view of the counterintuitive results arising from the rejection ratings of the “MAT” 
in Study I, Study II was undertaken first to see how the MAT related to Kohlberg’s 
MDI and to Damon’s Positive Justice Reasoning and what their underlying structure 
was, and secondly to ascertain how well each of these tests predicted a self-report 
measure of delinquent tendencies, taken as the converse of moral behavior. A 
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different measure of moral behavior from Study I was selected in order to see whether 
the MAT would perform similarly or differently with a new measure. A brief descrip- 
tion of the method follows; further details can be found in Rogers (1981). 


Method 
Sample 
The sample was recruited from all Grade 4 boys in two elementary schools in a lower 
socioeconomic area, in order to match the sample of Study I in sex and, roughly, in 
age. Full data were available for 47 boys with a mean age of 10 years 1 month, ranging 
from 8 years 3 months to 11 years 6 months. 


Procedures 

Intelligence measure 

The subjects’ recent percentile scores on the Canadian Lorge-Thorndike Intelligence 
Tests, verbal and nonverbal forms, were obtained from school board files. This test 
has been standardized on a Canadian population (Lorge, Thorndike, Hagan, & 
Wright, 1967) and its psychometric properties are satisfactory. 


Measures of moral judgment 

(a) Moral Dilemma Interview (MDI) (Kohlberg, 1978). This well-known test was 
administered to the children individually at school. Two stories—“Ben (Heinz) and 
the Drug” and “Joe Saves up for Camp”—were read out to the children. Questions and 
answers were tape-recorded and later transcribed. 

The experimenter (Susan Rogers Maunula) was trained in scoring the responses 
by an associate of Kohlberg’s, a worker at the Center for Moral Education and Devel- 
opment at Harvard University. After discussing five protocols and scoring them under 
his supervision, the experimenter scored the remaining protocols independently, 
using the latest scoring manual (Colby et al., 1979). 

The Standard Scoring method was employed in which individuals’ judgments are 
matched with structurally defined criterion judgments from the manual. A global 
stage score was calculated and, by assigning points for responses at each stage, a 
Moral Maturity Score (MMS) was also arrived at. Because the global stage score cor- 
related .97 with the MMS, only the latter was entered into the analyses. 

For the purpose of checking reliability, 12 protocols, not including the five on 
which agreement had been reached by discussion, were also rated independently by 
the Kohlberg research worker. The resulting reliability coefficients can be seen in 
Table 2. Further psychometric data on the MDI have been reported by Colby et al. 
(1983). 

(b) Positive Justice Reasoning Interview (Damon, 1977). The method consists of a 
presentation of stories which focus on children in school- or play-related situations. 
Subjects are individually asked what the characters should do in fairness to everyone 
concerned. Of the areas of the child’s thinking that Damon focuses on, “positive jus- 
tice reasoning” is perhaps most closely related to the development of moral attitudes 
as conceived by Damon, Kohlberg, and others. We therefore concentrated on this 
realm but, owing to time restrictions, only the story concerning the sharing of the pro- 
ceeds from the sale of pictures made by children was employed. This was read to the 
children individually following on the MDI, and the session was tape-recorded and 
later transcribed. Responses were scored in accordance with the manual (Damon, 
1975) and a “major stage score” assigned for each subject. 

For the purpose of establishing reliability, 20 protocols were randomly chosen and 
independently scored by the experimenter and a trained colleague. See Table 2 for the 
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TABLE 2 


Interrater Reliability Results 


Percentage Percentage 
Correlation Reasonable Perfect 

Test n Coefficient Agreement Agreement 
MDI: MMS 12 0.61 58% (within 17% 

12 points) 
MDI: Global Score 1 OsKT 83% (within 58% 

1/3 stage) 
“Positive justice 20 0.85 100% (within 70% 
reasoning interview” 1 substage) 


Note. MDI: Moral Dilemma Interview; MMS: Moral Maturity Score. 


reliability coefficients. Further data on reliability and internal consistency for these 
interview measures can be found in Damon (1977). 


(c) Moral Advice Test (MAT) (Carroll, 1974). This test was described under Study I. 
In Study II the test was administered to small groups of subjects. The dilemmas with 
their corresponding advice statements were read to the children who also had the 
typed material in front of them. They then marked their choices in the test booklet. 

Measure of moral behavior. In this study moral behavior was measured by its con- 
verse, namely delinquent or antisocial tendencies, as assessed by the subject’s concur- 
rence with statements in a self-report measure, the Jesness Inventory (Jesness, 1966), 
suitable for this age group. The Jesness Inventory is divided into subscales, and each 
statement relating to the individual’s behavior or attitude is assigned to one or more 
of these subscales. Examples are: “I seem to blow up a lot over little things that really 
don’t matter very much” (Manifest Aggression). “Police usually treat you dirty” 
(Alienation). “A lot of people say bad things about me behind my back” (Social 
Maladjustment). 

Jesness (1966) reports that his Inventory correctly identified 84% of his subject 
population composed of 963 delinquent and 925 nondelinquent male juveniles rang- 
ing in age from 8 to 19. From a survey of other studies, plus their own results, 
Saunders and Davies (1976) conclude that “there is now good evidence for suggesting 
that the social maladjustment, value orientation, alienation, and manifest aggres- 
sion scales are related to ‘deviant personality” (p. 35). These scales representing 
predelinquent attitudes were therefore administered in this study, a total of 104 ques- 
tions. 

Corrected odd-even reliability coefficients for individual subscales are reported by 
Jesness (1966). These range from the low .60s to the high .80s. Test-retest stability co- 
efficients (over an eight-month period) range from .40 to the high .70s. 

The test was administered as a group test following on the MAT. The children 
were told that the test consists of statements about the opinions and attitudes of 
young people. The statements were read out to the subjects and they indicated on the 
test form for each statement whether it applied to them. 
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Results 
In order to clarify the underlying structure and interrelationships of the different 
types of measures, a principal factor analysis with Varimax rotation was run. The sim- 
ple correlations can be seen in Table 3 and the factor matrix in Table 4. 

The rotated factor solution satisfies Thurstone’s criteria of “simple structure” ex- 
traordinarily well. Each factor represents one type of test: Factor 1—the Jesness 
scales; Factor 2—the multiple-choice Moral Advice Test; Factor 3—the open-ended 
moral judgment tests that require verbal production (Damon, 1975, 1977; Kohlberg, 
1978, 1980); Factor 4—the intelligence tests. Some subsidiary loadings should be 
noted. Thus, Jesness’ Manifest Aggression has a moderate negative loading on the in- 
telligence factor; Jesness’ Alienation has a very modest negative, and verbal intelli- 
gence has a good positive loading on the moral judgment factor (Factor 3). The four 
factors together account for a very high proportion (72.6%) of the total variance. 
However, the hypothesized negative relationship between antisocial tendencies and 
moral development materialized only in a very attenuated form—the loading of 
Alienation on the open-ended moral judgment factor. 

When verbal intelligence was partialed out, the correlations between Alienation 
and the MDI and Damon’s test were reduced further. 


Discussion 
Let us first discuss the structure and interrelationships of the different tests of moral 
reasoning. 

Damon’s tests and Kohlberg’s MDI were closely related, showing very high 
loadings on the same factor in Study IJ. These are the free-response tests requiring 
spontaneous verbal production of reasons, and it appears they correspond to one cog- 
nitive function in the child. Verbal ability, it will be remembered, also loaded on this 
moral judgment factor, and one possibility is that what binds the two tests together is 
their common verbal procedure, that is, the factor may be a methods factor. 

However, it seems their common variance is due to more than method or verbal 
ability. Further analysis revealed that the modal stage of responding for the MDI was 
Kohlberg’s Stage 2, and for Damon’s test it was his Stage 1A. The definitions of these 
respective stages, as it turns out, are very similar, both including the principle of equal 
sharing and exchange (cf. Colby et al., 1979; Damon, 1977). Hence it is clear that both 
tests elicit similar reasoning and thought structure from children at the modal stage. 
This suggests that, despite their diverse story content, the two tests tap similar kinds 
of moral thinking over and above verbal ability. 

Nevertheless, there is a fairly consistent relationship between verbal ability and 
expressed moral reasoning, both in its objective-format form—Study I (the associa- 
tions became nonsignificant in Study I) and in its free-response form—Study II. 

A limitation of the study was that it used only one of Damon’s stories. While the 
relationship between Damon’s test and the MDI that we found is encouraging, it is 
conceivable that, because of this restriction, other possible associations of Damon’s 
test may have gone unnoticed. In view of parallel findings for the MDI, however, this 
is unlikely. 

Can objective tests be taken as equivalent to free-response tests in measuring 
moral reasoning? For the DIT, the main objective moral reasoning test, Rest reports 
correlations of about .40 with the MDI from several studies, using homogeneous age 
groups (Rest, 1979). Froming and McColgan (1979), however, found that once age, 
SES and intelligence were partialed out, the correlation between the tests dropped to 
a nonsignificant .16. Nor did the MAT relate to the free-response tests of either moral 
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TABLE 3 


Correlations Among Measures Used 


JESM JESVO JESAL JESMA AR4 AR5 DAM KMMS IVERB INONV 
JESVO 74 
JESAL 67 18 
JESMA 06 13 50 
AR4 .06 .24 17 05 
AR5 —.05 = AO} —.04 = 12 61 
DAM —.14 == ,ihil allyl —.04 08 15 
KMMS —.05 — .04 = 24 —.08 == .07 20 42 
IVERB = Pr) =.29 = 30 — .30 =.31 — 01 27 34 
INONV 28 — .36 = 25 — .46 5) = 08 07 5 
AGE 17 — 08 13 408) 05 13 00 — .04 eer Seat) 


Note: r=0.29, significant at p .05, for n=47 
JESM: Jesness Social Maladjustment Scale; JESVO: Jesness Value Orientation Scale; JESAL: Jesness Alienation Scale; 
JESMA: Jesness Manifest Aggression Scale; AR4, AR5: Moral Advice Test—Acceptance Ratings, Stages 4 and 5; DAM: 
Damon’s Positive Justice Reasoning; KMMS: Kohlberg Moral Maturity Score; IVERB: Verbal Intelligence; INONV: 
Nonverbal Intelligence. 


reasoning or guilt in this investigation. Because the two types of test have little 
variance in common, they must elicit different thinking functions in the moral realm 
on which different children rank differently. 

It is clear that the free-response and the objective-format tests differ in that the 
MDI presents a production task, the DIT (and MAT) a recognition task. Because the 
latter task is cognitively easier, children achieve higher stage reasoning on the objec- 
tive test, as found by Rubin and Trotter (1977). Our investigation produced a similar 
finding, because Stage 4 was the most accepted stage on the MAT in both Studies I 
and II, whereas Stage 2 was the modal stage for the MDI. 

The main question of this investigation was the relation between moral thought 
(or feeling) and moral behavior. Of all the tests we employed, only the acceptance rat- 
ings of the MAT generated individually significant associations with several measures 
of moral behavior in one study, but they were not significant under simultaneous con- 
fidence testing, and these associations were not replicated in the second study. Fur- 
thermore, verbal ability accounted for the “significant” associations in Study I. Once 
this is taken out, moral reasoning and behavior appear to be independent areas of 
functioning. This conclusion was confirmed by Study II, both for the objective-format 
and the free-response tests, except for a weak negative relation between feelings of 
alienation and moral judgment. 

Indeed, verbal intelligence was a better (negative) predictor of antisocial tenden- 
cies than the moral judgment tests in Study II. It should be noted that both Kohlberg 
and Damon used their tests with 10-year-olds, and hence it does not seem unreason- 
able for others to employ them for this age group. In view of the demonstrated validity 
of the Jesness Inventory, it is not likely that the absence of prediction can be attrib- 
uted to any inadequacy of this test as an index of delinquent tendencies. 

The Guilt Index produced one association (significant only under independent 
testing), namely with antisocial conduct as rated by mother. Moral affect, therefore, 
seems to be somewhat predictive of moral behavior. Intensity of feelings about 
(imaginary) transgressions may well inhibit a person from committing such 
transgressions, because it is an expression of “conscience” as conceived popularly and 
in psychoanalytic thought. 

Blasi (1980) and Jurkovic (1980) both conclude that the majority of studies they 
reviewed showed heterogeneous delinquents to evince less mature moral reasoning 
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TABLE 4 


Varimax Rotated Factor Matrix 


a 


Variable Factor 1 Factor 2 Factor 3 Factor 4 Communality 


JESM .76 O01 eee) cet 61 
JESVO .98 09 0m = ai4 98 
JESAL 19 a = .30 01 ihe 
JESMA .68 == 12 205: Seonte! 62 
AR4 sel .76 =.04 =114 .62 
AR5 On .80 wy) 201 .69 
DAM Ale! 10 ray .03 29 
KMMS 05 03 .68 04 46 
IVERB —2\) = 2) aay .60 wh 
INONV =e) =e 04 .78 .68 
AGE O1 16 = 15 10) .06 
Summed square factor ais 1.36 Pe LAT 

loadings after rotation 

Percent of estimated 42.6 ait dl 18.1 18.1 


common variance 


Note: For variable names see Table 3. 


levels than their nondelinquent counterparts. An inspection of the studies they list, 
however, reveals that if one confines oneself to published studies that contrast delin- 
quents with nondelinquents, the majority is very thin. These investigations compared 
the moral reasoning levels of matched groups, the delinquent groups having an ex- 
treme character, in that they consisted of persons who had been legally declared 
offenders. Our study differs from these, because we sought an association between de- 
grees of moral reasoning and a continuum of delinquency-proneness in a normal pop- 
ulation and because our sample was younger, and these differences may in part 
explain the divergent findings. 

Some possible reasons mentioned by Kohlberg (1978) for discrepancies between 
moral reasoning and moral behavior are referred to in the introduction to this paper. 
Such considerations can be summed up by Rest’s statement “moral judgment is an 
important factor in real life decision-making, but... the interaction with other factors 

. complicates the relationship so that simple, linear correlations cannot be ex- 
pected” (Rest, 1979, p. 260). Nevertheless, “moral” reasoning must derive its validity 
as a “moral” indicator from some relationship to what is generally recognized as 
“moral” behavior. This can be demanded particularly when the measure of moral be- 
havior is based on many situations, as in ratings or self-reports. The absence of such a 
relationship between thought and deed must weaken the claims to validity and use- 


fulness of the moral judgment model and of cognitive developmental theory in general 
insofar as it applies to the moral domain. 
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Findings such as ours also have implications for moral education. Moral education 
has often been carried on by means of discussions of moral dilemmas, such as 
Kohlberg posed. Kohlberg himself in the 70s became dissatisfied with the discussion 
approach as a means of moral education (Kohlberg, 1980). A critical ingredient of 
moral education, he came to believe, was social participation. Hence his later at- 
tempts at moral education have emphasized self-governance in small schools, where 
students have to consider the moral implications of realistic school issues and practice 
moral behavior within a group that attempts to be a “just community” (Power & 
Reimer, 1978). 

Discussions of moral dilemmas in the Kohlberg tradition emphasize structure (i.e., 
the reasoning behind moral decision rather than the decision itself). But it may be im- 
portant for moral education to deal not only with the structure of moral judgments, 
but also with their content, that is, the specific situations and areas of behavior that 
generate moral dilemmas (Locke, 1983). Moreover, because the relationships of moral 
reasoning to behavior are uncertain, it may be more effective to inculcate moral values 
and try to enhance other factors that affect moral behavior, for example, ego strength 
and motivation. 
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The Etiology of Administrator Burnout 


Both quantitative and qualitative research methods were used inan attempt to explore 
more fully the nature of burnout among 128 school-based administrators in a large 
western Canadian urban school district. Administrators in general recorded lower than 
normal levels of burnout, although some 34% recorded a higher than normal level of 
Personal Accomplishment burnout. Major predictors of burnout were Overall Work 
Stress, Satisfaction with Status and Recognition, and Satisfaction with Interpersonal 
Relationships. The qualitative data were supportive of these findings, with administra- 
tors indicating that work load, interpersonal relationships, and lack of sufficient posi- 
tive feedback were major contributors to feelings of burnout. These findings point to the 
importance of developing coping skills among administrators who experience work 
stress. Further, the findings indicate that the cultivation of human relations skills and 
the adequate provision of rewards and recognition are important in dealing with burn- 
out among school-based administrators. 


Interest in burnout as it affects both individuals and organizations has shown excep- 
tional growth in recent years. This interest, however, cuts a broad swath, and there- 
fore lacks uniformity in its conceptualization of burnout and in the methodologies 
used to measure the syndrome. The numerous studies, such as those of Anderson and 
Iwanicki (1984) and Farber (1984), are primarily quantitative in nature. Only few 
studies such as those by Blase (1982, 1986) have attempted a qualitative approach in 
examining burnout, particularly among educators. Even fewer studies have at- 
tempted to triangulate research (Jick, 1979) in an exploration of burnout. In addition 
to this narrow methodological approach, many studies of burnout in education have 
focused on teachers’ experiences of the syndrome. There are indications, however, 
that researchers are becoming more aware of the unique pressures and demands on 
school-based administrators which may lead to stress and burnout (Gmelch & Swent, 
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1984; Kottkamp & Travlos, 1986; MacPherson, 1985; Ratsoy, Sarros, & Aidoo-Taylor, 
1986). The study reported in this article was undertaken in response to some of these 
methodological and conceptual inadequacies in the data base on administrator burn- 
out. 


Purposes 
The purposes of this study were to describe the nature of burnout among school-based 
administrators, to examine the extent to which selected job factors predicted burnout, 
and to describe the work conditions administrators identified as major contributors to 
feelings of burnout. 

To fulfill these objectives three questions were examined: (a) What is the nature of 
burnout among school-based administrators? (b) To what extent do overall work 
stress, job satisfaction, role clarity, and job challenge predict burnout among adminis- 
trators? (c) Which work conditions do administrators identify as contributing most to 
feelings and attitudes associated with each subscale of burnout as defined by Maslach 
and Jackson (1981a), namely Se Exhaustion, Depersonalization, and Personal 
Accomplishment burnout? 


Background to the Study 

Various conceptualizations and definitions of burnout exist (Carrol & White, 1982; 
Cherniss, 1980a; Freudenberger, 1974, 1975; Gmelch, 1983). However, a widely-used 
conceptualization of the syndrome is provided by Maslach and Jackson (1981a). Gen- 
erally, burnout has been linked to the psychological, physiological, and behavioral 
responses to chronic work stress experienced by helping service professionals such as 
social workers, counselors, teachers, and police officers. Maslach and Jackson (1981a, 
1981b) identified the three subscales of Emotional Exhaustion, Depersonalization, 
and Personal Accomplishment burnout as together constituting the burnout 
syndrome. They also developed the Maslach Burnout Inventory (MBI) to measure 
varying degrees of burnout on each of the three subscales. 

Emotional Exhaustion burnout refers to feelings of physical and emotional 
overextension and exhaustion caused by daily work pressures. Maslach (1982) defined 
this type of burnout as “a pattern of emotional overload... [where people] lack energy 
to face another day” (p. 3). Depersonalization burnout refers to the development of 
negative attitudes, such as cynicism and negativism, in which one’s clients and co- 
workers, according to Maslach and Pines (1977) “are labeled in derogatory ways and 
treated accordingly” (p. 101). Personal Accomplishment burnout appears to be more 
responsive to the motivational needs of helping service professionals. Maslach and 
Jackson (1981a) described this form of burnout as “the tendency to evaluate oneself 
negatively, particularly with regard to one’s work with clients” (p. 1). 

Each of the variables used as predictors of burnout, namely work stress, job satis- 
faction, role clarity, and job challenge, was chosen on the basis of research findings 
which suggest a need for further examination of these variables as they relate to burn- 
out (Meier, 1984). 

The first variable explored as a predictor of burnout was overall work stress. While 
various studies have examined the relationship between stress and burnout such as 
those of Cedoline (1982), Friesen (1986), and Gmelch (1983), there is insufficient evi- 
dence to describe the extent to which burnout as measured by the MBI is related to 
work stress. This scarcity in the data base is even more prevalent in an educational 
context. The concept of stress which was proposed by Cox and Mackay (1981), Laza- 
rus and Launier (1978), and McGrath (1976) was adopted for purposes of this study. 
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In this conceptualization stress is viewed as a distressful response to a demanding sit- 
uation that is perceived as exceeding the person’s resources and capabilities for meet- 
ing the demand. This transactional concept of stress extends beyond the physiological 
interpretation of stress identified by Selye (1956, 1974), and includes the fundamental 
everyday, interactive relationships among individuals, organizations, and environ- 
ments as leading to stress. 

Aspects of job satisfaction were also examined as predictors of administrator 
burnout. Maslach and Jackson (1981b) suggested that satisfaction and burnout are 
conceptually different, and that job satisfaction explains only a small percentage of 
the variance in any one subscale of burnout. Moreover, many studies have isolated 
overall job satisfaction as a predictor of burnout, without addressing those individual 
work conditions which contribute to both satisfaction and burnout. Recent research 
by Kottkamp and Travlos (1986) illustrated a significant negative correlation be- 
tween job satisfaction and Emotional Exhaustion burnout (r= —.61) for high school 
principals. Research by Anderson and Iwanicki (1984) indicates that specific facets of 
the job which contribute to satisfaction may also predict burnout. On the basis of 
these inconclusive findings, further research is needed to explore the extent to which 
identifiable factors of job satisfaction predict burnout among school-based adminis- 
trators. 

The third variable explored as a predictor of burnout in this study was role clarity 
(i.e., the absence of role ambiguity). Numerous research findings indicate a positive 
correlation between role ambiguity and stress and burnout. The evidence suggests 
that clearer and more informed job descriptions both mitigate levels of stress and 
burnout and contribute to jok satisfaction (Cherniss, 1980a; Litt & Turk, 1985). 
Nonetheless, the relationship between role clarity and burnout, particularly among 
school administrators, is somewhat tenuous as indicated by the research of Farkas 
(1984) and others. Further exploration of this relationship is warranted on the basis of 
these and other related research findings. 

Job challenge was examined as the fourth predictor of burnout among school- 
based administrators. Walsh, Taber, and Beehr (1980) described job challenge as the 
“degree to which the knowledge, skills and abilities of the role incumbent are per- 
ceived to be engaged or enlarged by the job” (p. 255). Studies by Holroyd and Lazarus 
(1982) indicate that increased job challenge both contributes to job satisfaction and 
assists in the alleviation of stress and burnout. However, the extent to which job chal- 
lenge predicts burnout among school-based administrators is largely speculative, and 
for this reason further research into this relationship is justified. 


Method 
Both quantitative and qualitative research techniques were used in order to investi- 
gate the nature of burnout among school-based administrators. These techniques re- 
flected the exploratory and descriptive design of the study, which was concerned with 
an identification of underlying variables that are representative of more specific and 
substantive states or conditions (Kerlinger, 1973). 


Sample 

A study was conducted of 763 educators (635 teachers and 128 administrators) in a 
large western Canadian school district (Sarros, 1986). The 128 school-based adminis- 
trators from this sample are the focus of this paper. These administrators consisted of 
66 principals and 62 assistant principals, representing a return rate of 82% and 77% 
respectively. Based on the results of t-tests which revealed no statistically significant 
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differences in the mean scores of principals and assistant principals on each subscale 
of burnout and for each predictor of burnout, the two groups were treated as one, the 
administrator group. 

About 83% of administrators were male. The average number of years as an edu- 
cator was 21. About 44% had 10 or fewer years experience as an administrator, 45% 
reported 11 to 20 years, and 11% had 21 or more years administrative experience. Ap- 
proximately one-quarter of all administrators indicated their work was considerably 
stressful, 8% very stressful and 2% extremely stressful. Nonetheless, 62% were very 
to extremely satisfied with their work. Forty-seven percent of administrators were lo- 
cated in elementary schools, 28% in elementary-junior high schools, and 25% in sec- 
ondary schools. 


Instrumentation 

A 72-item survey questionnaire was used in the collection of data. This questionnaire 
contained five sections: (a) respondent demographic information, including an item 
on overall work stress; (b) a job satisfaction instrument; (c) a job characteristics in- 
strument; (d) the MBI (Maslach & Jackson, 1981a); and (e) a personal comments sec- 
tion. Overall work stress was measured on a six-point scale ranging from one (1) for 
“Not Stressful” to six (6) for “Extremely Stressful.” 

The 26-item job satisfaction instrument used in this research was developed from 
the Minnesota Satisfaction Questionnaire: Short Form by Weiss, Dawis, England, and 
Lofquist (1967), the Satisfaction with Teaching and Employment Conditions Ques- 
tionnaire by Holdaway (1978), and the job satisfaction questionnaires by Gunn (1984) 
and Rice (1978). Administrators were requested to rate each item in response to the 
statement “Jn your present position, indicate how you feel about each of these 
aspects.” Responses on a six-point scale were as follows: 0= Dissatisfied; 1=Slightly 
Satisfied; 2= Moderately Satisfied; 3= Considerably Satisfied; 4= Very Satisfied; and 
5=Extremely Satisfied. The Spearman-Brown split-half reliability for the instru- 
ment established as a result of pilot tests among school-based and district-based 
teachers, administrators, as well as personnel from medicine and engineering, ranged 
from .93 to .95. A principal components factor analysis using Varimax rotation and 
iterations yielded seven factors. These factors accounted for 65.2% of the variance in 
job satisfaction based on responses from school-based educators. The seven factors 
and their constituent items were as follows: 


1. Satisfaction with Status and Recognition refers to the sense of status and 
recognition associated with the job, and the intellectual stimulation and 
sense of accomplishment provided; 


2. Satisfaction with Autonomy refers to the freedom given educators to use 
their own judgments and methods; 

3. Satisfaction with Interpersonal Relationships refers to interactions with 
peers, subordinates, and superordinates; 

4. Satisfaction with Advancement refers to methods used in evaluation and 
promotion; 

5. Satisfaction with Security and Involvement refers to the items of job se- 
curity, and the chance to help other people; 

6. Satisfaction with Work Load refers to the amount of work and the extra 
tasks of teaching and/or administration; 

7. Satisfaction with Salary and Benefits refers to salary and fringe benefits. 


These factors of job satisfaction were used as predictor variables of each subscale of 
burnout intensity as measured by the MBI. 
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The job characteristics instrument developed by Walsh, Taber, and Beehr (1980) 
was used to measure the factors of Role Clarity and Job Challenge. Respondents were 
asked to indicate the extent to which each component of the job characteristics 
occurred on the job. Responses were on a six-point scale as follows: 0=Not at all; 
1=Rarely; 2=Sometimes; 3=Frequently; 4=Often; and 5=Always. Walsh, Taber, 
and Beehr (1980) reported reliability coefficents of .71 and .73 for the role clarity and 
job challenge scales respectively. Content validity was established through studies by 
Beehr, Walsh, and Taber (1976) and Walsh, Taber, and Beehr (1980). 

An amendment to the job characteristics scale was the rewording of Item 3 on the 
role clarity scale to avoid reverse scaling. A factor analysis of the revised instrument 
revealed that each item loaded above 0.40 on the factors Role Clarity and Job Chal- 
lenge as indicated by the test developers. These two factors accounted for 69.1% of 
the variance in job characteristics based on responses from school-based teachers and 
administrators. The factors and their constituent items included the following: 
(a) Role Clarity refers to clearly-stated performance standards, goals, and directives; 
(b) Job Challenge refers to the opportunities provided educators to use their skills and 
abilities, and to keep learning new things on the job. These factors of job characteris- 
tics were used as predictor variables of each burnout subscale. 

The MBI developed by Maslach and Jackson (1981a, 1981b) was used to measure 
each subscale of burnout intensity for respondents in this study. The intensity dimen- 
sion only was chosen based on recommendations by Iwanicki and Schwab (1981). 
Burnout intensity was measured on an eight-point scale ranging from zero (0) for 
“Never” to seven (7) for “Major, Very Strong.” Maslach and Jackson (1981a) calcu- 
lated Cronbach’s coefficient alpha as a measure of internal consistency for each 
subscale. The coefficients for each subscale of burnout intensity were the following: 
Emotional Exhaustion (.87), Depersonalization (.76), and Personal Accomplishment 
(ia). 

A factor analysis of the responses of educators on the Maslach Burnout Inventory 
revealed that every item in the 22-item inventory loaded above 0.40 on the factor indi- 
cated by the test developers. The three factors of Emotional Exhaustion, 
Depersonalization, and Personal Accomplishment burnout accounted for 51.5% of 
the variance in burnout intensity. The intercorrelations among all factors and burn- 
out subscales for administrators are presented in Table 1. 

The personal comments section included three questions derived from Maslach 
and Jackson’s (1981la) definition of the three dimensions of burnout as follows: 
(a) Which two factors in your work contribute most to feelings of being overextended 
and exhausted by your work? (b) Which two factors in your work contribute most to 
the development of negative attitudes and impersonal responses toward clients? 
(c) Which two factors in your work contribute most to feelings of a lack of personal ac- 
complishment on the job? 


Assumptions and Limitations 

One of the assumptions for this study was that burnout and related concepts are 
persistent feelings which can be measured in a statistical manner, and that burnout is 
a negative feeling in the respondent. Further assumptions were that the respondents 
provided accurate responses to the items, and that the resultant scores are reliable es- 
timates of the degree of burnout experienced by the respondents. These assumptions 
place limitations on the degree to which the findings of the study are truly representa- 
tive of burnout in these school administrators. 
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TABLE 1 


Pearson Correlation Coetiiients Between Overall Work Stress, 
Job Satisfaction, Job Characteristics, and Burnout Subscales for 
Administrators (N= 128) 


Work Stress, 

Factors, and 

Overall 

Burnout Work 

Subscales Stress 1 2 3 4 5 6 ff 8 9 EE DEP PA 


Job Satisfaction Factors 


1. Status and =a 1.00 
Recognition 
2. Autonomy Salis 78 1.00 
3. Interpersonal — 19 59 D3 1.00 
Relationships 
4, Advancement .00 48 43 28 1.00 
5. Security and Alls} 2 48 47 29 1.00 
Involvement : 
6. Work Load 100) .62 coll 33 42 30 1.00 
7. Salary and 01 .30 ea ai JH 123 .34 1.00 
Benefits 
Job Characteristics Factors 
8. Role Clarity = be! 54 .40 42 31 32 50 stil 1.00 
9. Job —.02 54 DO 40 ee, 45 Oo i? 53 1.00 
Challenge 
Burnout Subscales 
Emotional Uke 7A) 23 .24 -l) Ds ee) iy Peay eNO) 1.00 
Exhaustion (EE) 
Depersonalization a) Pal 26 20 .07 3S) 08 04) yealiay e eeal9 52 1.00 
(DEP) 
Personal Accom- 00 at 36 40 04e a 0 8 eS LOT F138) 8 2204 .30 40 1.00 
plishment (PA) 
Data Analysis 


The major statistical technique used in this study was multiple stepwise linear regres- 
sion analysis. Weiss (1976) mentioned that the incremental procedure of multiple re- 
gression works from the “bottom up.” In this procedure, variables are added which 
have the highest relationship with the criterion variable, as indicated by their partial 
correlations with the criterion. As variables are added, the multiple correlation is 
recomputed at each step until the increments in the multiple correlation coefficient 
(R) become no longer statistically significant. In this study, the criterion variable was 
represented by each subscale of burnout intensity. 

Qualitative data were subjected to content analysis as outlined by Bogdan and 
Biklen (1982). Content analysis is used to tabulate data or information in a meaning- 
ful, systematic, and objective manner. According to Carnoy (1979, p. 64) content an- 
alysis is appropriate “when source material is used to complement some other kind of 
data during an inquiry into attitudes.” 

As a first step, the content analysis of a random sample of open-ended responses 
by two independent coders yielded categories and subcategories of sources of burnout. 
The interrater reliabilities for the coding of these responses ranged from .83 to .88. 
Once these categories were established all the questionnaires were included in the an- 
alysis of responses to the three open-ended questions. The total frequency of each cat- 
egory and the frequency with which each category appeared in each burnout scale 
were determined. These findings were then compared with those obtained in the re- 
gression analysis of the quantitative data. 
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Findings 
The Nature of Burnout Among Administrators 
Maslach and Jackson (1981a) established norms and cut-off points for low, moderate, 
and high levels of burnout based on their studies of 2,118 helping service profession- 
als. These cut-off points were used to establish a normative distribution, with one- 
third of the respondents in each of the low, moderate, and high burnout categories. 
The cut-off points and distributions so established were used to determine the per- 
centage distributions of respondents in this study in each burnout subscale, as pre- 
sented in Table 2. For matters of consistency, the scores for Personal Accomplishment 
burnout were reversed. Thus administrators who reported relatively high levels of 
burnout would register higher than normal scores on each burnout subscale. As indi- 
cated in Table 2, the mean scores of administrators for the first two subscales of burn- 
out were lower than their corresponding norms. Fewer than 10% of administrators 
reported high degrees of Emotional Exhaustion and Depersonalization burnout, com- 
pared with 33.6% who reported a high degree of Personal Accomplishment burnout. 
These findings indicate that school-based administrators tended to experience less 
burnout compared with other groups of helping service professionals. 


Predictors of Burnout Among Administrators 
The results of multiple stepwise linear regression analysis for prediction of burnout in 
administrators are presented in Table 3. 

For Emotional Exhaustion burnout, Overall Work Stress was the major statisti- 
cally significant predictor, contributing 23% to the total variance in this subscale. 
“Satisfaction with Work Load” contributed an additional 5% to the variance in 
Emotional Exhaustion burnout. As indicated by the negative correlation coefficient, 
the less satisfied administrators were with their work load, the more they tended to 
experience Emotional Exhaustion burnout. 

Depersonalization burnout was best predicted by Satisfaction with Status and 
Recognition, followed by Overall Work Stress and Satisfaction with Work Load. 
When taken together these factors contributed 14% to the total variance in this 
subscale. 

Satisfaction with Interpersonal Relationships contributed 16% to the variance in 
Personal Accomplishment burnout in administrators. The predictors combined con- 
tributed 28% of the variance in this subscale of burnout. In all cases, Role Clarity and 
Job Challenge did not enter as statistically significant predictors of burnout. 


Work Situations Contributing to Burnout 

The frequency with which work conditions were identified by administrators as con- 
tributing most to feelings and attitudes associated with burnout is presented in 
Table 4. As shown in the first column, interpersonal relationships were mentioned 104 
times by the 128 administrators, followed by perceived pressures from higher author- 
ities (69), work load (64), attitudes of parents and public (56), insufficient time (49), 
negative feedback (40), excessive demands (37), interactions with students (36), and 
self-concept (25). 

Further analysis revealed that interpersonal relationships were viewed as major 
contributors to all three dimensions of burnout. Thirty-eight percent of administra- 
tors claimed these relationships contributed to feelings of Depersonalization burnout. 
A further 26% claimed interpersonal relationships led to Emotional Exhaustion 
burnout, and 18% identified Personal Accomplishment burnout as a result of unsatis- 
factory interpersonal relationships. Selected illustrative comments of administrators 
all with at least 15 years of experience are provided as further insights into work con- 


ditions contributing to burnout. 
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TABLE 2 


Comparison of Administrator Sample with Established Norms 
of Burnout Intensity (N=128) 


Lower Middle Upper 
Burnout Subscale Third Third Third Mean Norm 
Emotional Exhaustion 66.4 28.1 5.5 2.54 3:02 
Depersonalization 55.5 35.1 9.4 1.42 2.34 
Personal Accomplishment 28.1 38.3 33.62 2.02 2.04 


Note: @ This means that 66.4% of administrators indicated a low degree of Emotional 
Exhaustion burnout. 
b This means that 33.6% of administrators indicated a high degree of Personal 
Accomplishment burnout. 


TABLE 3 


Stepwise Multiple Regression Analysis 
for Prediction of Burnout Among Administrators 
(N=128) 


Dependent 
Variable Predictors B r R | By 
Emotional Overall Work 473 48 48 yp) 
Exhaustion Stress 
Satisfaction With 2218 =e) 153 .28 
Work Load 
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Depersonalization Satisfaction With oa =H PR .O7 
Status of Recognition 
Overall Work .308 23 Foe lil 
Stress 
Satisfaction With — Prey! == {ie coil 14 
Work Load 
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Personal Satisfaction With = shit —.40 40 16 
Accomplishment Interpersonal Relationships 
Satisfaction With — 309 = .38 46 eal 
Security & Involvement 
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Advancement 
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Autonomy 


(Constant) 4.684 
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Work Conditions Identified by Administrators 


as Contributing to Burnout 


(N=128) 
Burnout Dimensions 
Work 
Conditions Emotional Personal 
Identified Exhaustion Depersonalization Accomplishments 

N % N % N % 
Interpersonal 1048 33° 26° 48 38 23 18 
relationships 
Perceived pressures 69 33 26 13 10 23 18 
from authorities 
Work load 64 | 49 38 0 0 15 12 
Attitudes of parents 56 20 16 25 20 nl 9 
and public 
Insufficient time 49 23 18 13 10 13 10 
Negative feedback 40 3 2 6 5 | 31 24 
Excessive demands aul 1 9 7 13 10 8 
Interactions with 36 US 12 il 10 8 6 
students 
Self-concept 25 0 0 13 10 12 9 
Changing curricula 3 3 2 0 0 0 0 


a “Interpersonal relationships” were mentioned 104 times as major contributors to burnout by 128 


administrators. 


b “Interpersonal relationships” were mentioned by 33 administrators as major contributors to 


Emotional Exhaustion burnout. 


¢ “Interpersonal relationships” were mentioned by 26% of the administrators as major 


contributors to Emotional Exhaustion burnout. 
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Interpersonal relationships. The quality and intensity of interpersonal relation- 
ships contributed to each form of burnout among school-based administrators. Of 
particular concern was the conflict caused by differences in ideologies and values be- 
tween administrators and their staff. For instance, a male elementary school principal 
wrote of teachers who were “unwilling to accept their responsibilities as professionals” 
as leading to his feelings of cynicism and negativism associated with Deperson- 
alization burnout. This finding supports Cherniss’ (1980b) claims that “differences in 
personal values” contribute to conflict, mistrust, and burnout among helping service 
professionals (p. 121). 

Feelings of Emotional Exhaustion burnout occurred because of the “constant 
bombardment of people,” according to a male senior high assistant principal. A male 
junior high principal commented on the “lack of cooperation among your staff” as con- 
tributing to Personal Accomplishment burnout. The finding that stressful interper- 
sonal relationships contributed to each type of burnout among school administrators 
is consistent with the claims of Welch, Medeiros, and Tate (1982) that a “principal’s 
interactions with people are almost always negative” and contribute to burnout 
(p. 47). 

Perceived pressures from authorities. The policies and directives from central 
office, local school boards, and the Department of Education contributed substantial- 
ly to feelings of Emotional Exhaustion and lack of Personal Accomplishment. A fe- 
male elementary-junior high principal was confused and exhausted by “too many 
demands outside the school (central office, etc.) ... [which] take me away from the re- 
sponsibilities of my school.” A male elementary school principal wrote of the “lack of 
input into major decisions” as contributing to feelings of Personal Accomplishment 
burnout. 

Work load. Administrators identified an excessive work load as leading to feelings 
of Emotional Exhaustion and Personal Accomplishment burnout. A male elementary 
school principal claimed exhaustion by “Working in an environment where everything 
should have been completed ‘yesterday.’ The paper ‘blizzard,” the unrelenting pres- 
sure of deadlines, and the need to satisfy a variety of clients and to complete numer- 
ous administrative duties led one male elementary principal to “feel [I am] not doing 
an outstanding job.” These findings reveal that the demanding nature of school ad- 
ministration is not only a fact of school life as documented by Willis (1980) and 
Wolcott (1973), but is also a major contributor to stress and feelings of burnout as 
found by MacPherson (1985) and Ratsoy and Friesen (1985). 

Attitudes of parents and public. Some administrators indicated that the increas- 
ing and often unreasonable expectations of parents and various publics contributed to 
the development of negative attitudes and cynical responses in themselves. Cherniss 
(1980b), Freudenberger (1977) and Maslach and Pines (1977) identified these 
responses as indicators of Depersonalization burnout. As an example, a male 
elementary-junior high school principal commented on “parents who do not give 
schools credit or trust” as leading to feelings of Depersonalization. 

Insufficient time. The perceived lack of time to accomplish tasks led to each form 
of burnout, but predominantly to Emotional Exhaustion burnout. A male 
elementary-junior high assistant principal was exhausted by “[not] having enough 
time to accomplish (and do well) the things that can be done.” The “pressure of dead- 
lines” contributed to feelings of negativism and cynicism in a female elementary 
school principal. A male elementary-junior high principal experienced feelings of 
Personal Accomplishment burnout because of “the lack of time to really sit down and 
discuss issues with staff.” 
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Negative feedback. About one-quarter of the administrators claimed that inade- 
quate or negative feedback contributed to Personal Accomplishment burnout. The 
comments of a female elementary school assistant principal are fairly representative 
of this attitude: “Only hear negative responses. Not often given ‘a pat on the back.” 
This lack of feedback reinforced the beliefs of many administrators that their jobs 
were often unrecognized, unrewarded, and always criticized. Cherniss (1980b) 
stressed the importance of public recognition and feedback from peers as “critical re- 
sources” in helping to prevent burnout. Maslach (1982, p. 19) claims that professional 
helpers “don’t hear much when things are going right, but they sure hear plenty when 
things are going wrong.” Negative feedback was not cited as a major contributor to the 
other two forms of burnout. 

Excessive demands. The intraschool factors of resource availability, physical 
working conditions, and staff needs contributed somewhat to all three forms of burn- 
out. A female elementary-junior high principal identified school size as a contributor 
to Depersonalization burnout: “Too many demands which take me away from the per- 
sonal side . . . The size of the school contributes to this.” Feeling powerless to alter sig- 
nificantly one’s working conditions, Cherniss (1980b) maintained, “would contribute 
to greater stress and the maladaptive coping pattern associated with .. . burnout” 
(p. 100). 

Interactions with students was seen as contributing somewhat to all three dimen- 
sions of burnout. Self-concept concerns contributed somewhat to Depersonalization 
and Personal Accomplishment burnout. Changing curricula was mentioned only three 
times as a source of burnout. 


Discussion 

The Nature of Burnout 

The findings of this study as indicated in Table 2 reveal that school-based administra- 
tors recorded lower overall mean scores than the established norms for Emotional 
Exhaustion and Depersonalization burnout, and a mean score similar to the norm for 
Personal Accomplishment burnout. The administrator distribution when compared 
with the normative distribution of burnout revealed that most administrators experi- 
enced low to moderate levels of each form of burnout. These findings are consistent 
with the research of Gorton (1982), Hiebert and Basserman (1986), and Ratsoy and 
Friesen (1985) which reveal that school-based administrators are experiencing lower 
levels of stress and burnout than previously expected. Apparently the oft-heard claim 
that school administrators are “survivors” in the school system cannot be dismissed 
too lightly. School administrators may have learned how to cope with the demands of 
administration and the various developments in their career paths. These survival 
skills may in part be conditional on an informed social support network (Ratsoy et al., 
1986, p. 280) which helps them deal with work stress and burnout. Future research 
may explore the strategies administrators use to help them cope with burnout on the 


job. 


Predictors of Burnout Among Administrators 

The findings of this study are consistent with other research findings that work stress 
and burnout are related (Blase, 1982; Gmelch, 1983; Kottkamp & Travlos, 1986). 
Nonetheless, this study has added new evidence to this relationship by indicating that 
work stress in combination with other work factors contributes differentially to the 
variance in each subscale of burnout. In particular, more of the variance in Emotional 
Exhaustion burnout compared with the other two subscales is explained by Overall 


173 


J.C. Sarros and D. Friesen 


Work Stress and Satisfaction with Work Load. The significant negative correlation 
between satisfaction with work load and stress (r= —.39, p<.01, Table 1) and the 
contribution of these two variables to burnout are fairly well documented in the litera- 
ture and research (Blase, 1986; Farber, 1984; Friesen, 1986; Gmelch, 1983; Kottkamp 
& Travlos, 1986). 

The differences in the etiologies of each burnout subscale may be further under- 
stood by comparing the predictors of Emotional Exhaustion burnout with those for 
Depersonalization and Personal Accomplishment burnout. In the case of the latter 
two subscales, the major predictors were Satisfaction with Status and Recognition 
and Satisfaction with Interpersonal Relationships respectively. These two subscales 
may be more the result of a job which is perceived as failing to satisfy the motivational 
needs of recognition, feedback, and sense of accomplishment among administrators, 
rather than the result of work stress as is Emotional Exhaustion burnout. Maslach’s 
(1982) belief that failure of the job to satisfy the needs to be “admired and respected 
by other people” (p. 65) contributes to burnout is borne out by the findings of this 
study. Although Overall Work Stress and Satisfaction with Work Load contributed a 
further 6.4% to the variance in Depersonalization burnout, the total variance ex- 
plained was less than 14%. Other work factors not measured by the research instru- 
ments used in this study may contribute to this form of burnout, and additional 
research is recommended in an examination of these work factors and their relation- 
ships to burnout. 

The finding that Satisfaction with Interpersonal Relationships and other contex- 
tual factors contributed to Personal Accomplishment burnout concurs with various 
studies. Welch et al. (1982) claimed that stressful interpersonal relationships are re- 
lated to the lack of sufficient training in human relations skills. This deficiency, the 
authors maintained, “condemn{s] the principal to at least partial failure all the time, 
and contributes to burnout” (p. 48). When these interpersonal interactions are com- 
bined with the frustrations which arise from a job that restricts involvement, auton- 
omy, and advancement, it is not surprising that some school administrators begin to 
doubt their achievements on the job. A higher than normal level of Personal Accom- 
plishment burnout in many cases, as the results of this study reveal, would therefore 
not be unexpected. Many school systems may have to be prepared for the challenge of 
guarding against this tendency of administrators to undervalue their accomplish- 
ments at work. 

The results of this study revealed that both Role Clarity and Job Challenge were 
not statistically significant predictors of burnout for administrators. The finding that 
Role Clarity did not predict burnout is consistent with research by Bedeian, 
Armenakis, and Curran (1981) who found no significant relationships among role 
ambiguity, organizational level of responsibility, and stress for 202 nursing personnel. 
Further explanation of the tenuous relationships among Role Clarity, Job Challenge, 
and burnout may be derived from the findings of Walsh et al. (1980). Their study of 
957 employees in a large manufacturing company indicated that when used together 
with job satisfaction, “more of the variance in job satisfaction can be accounted for by 
considering the effects of challenge and role clarity simultaneously than by consider- 
ing either one separately” (p. 255). In the present study, factors of job satisfaction may 
have explained most of the variance in burnout otherwise attributable to Role Clarity 
and Job Challenge. Future research, therefore, may explore the influence on burnout 
of both these factors when used independently of job satisfaction. 
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Work Conditions Contributing to Burnout 

Some striking similarities are obvious when the work conditions administrators iden- 
tified as major contributors to burnout are compared with the findings of the regres- 
sion analysis. For instance, work load was the key contributor to feelings of Emotional 
Exhaustion burnout as identified by administrators, while Overall Work Stress and 
Satisfaction with Work Load were the significant predictors of this subscale. The fact 
that both quantitative and qualitative data are consistent in identifying the etiology 
of Emotional Exhaustion burnout (as well as the etiologies of the other subscales) pro- 
vides evidence in support of triangulated research methodologies and their enrich- 
ment of research findings. The relationships among work load, stress, and burnout 
revealed in this study are similar to the findings of Blase (1986), Cedoline (1982), and 
Gmelch and Swent (1984). 

Work load and lack of sufficient time to complete tasks are related. Blase (1986) 
suggested that “time may be one of the most important and neglected variables in un- 
derstanding stress” (p. 23), and that it “cannot be understood as independent of other 
sources of stress” (p. 28). Administrators indicated that both an increased work load 
and insufficient time contributed to each form of burnout. These findings fit the pat- 
tern of “burned-out” administrators described by Welch et al. (1982): “Faced with too 
much to do, with declining resources and decaying autonomy, the satisfactions associ- 
ated with being a principal are being eroded . . . the result—burned-out principals” 
(p. 45). 

The findings also indicate that Maslach and Jackson’s (1981a, 1981b) conceptual- 
ization of burnout is related to the transactional concept of stress as developed by a 
number of researchers. From the transactional perspective, McGrath (1976, p. 1352) 
defined stress as the result of a situation presenting a demand which is perceived as 
exceeding the person’s capabilities for meeting that demand. In this study, adminis- 
trators felt the constant shortage of time was stressful. Their failure to alleviate this 
stressful response contributed to burnout. Many administrators identified interper- 
sonal relationships as a major contributor to negative attitudes and impersonal 
responses associated with Depersonalization burnout. Other work factors associated 
with this type of burnout were the unfavorable attitudes of parents and various 
publics, time constraints, and excessive demands. These qualitative data are 
consistent with the major predictors of Depersonalization burnout (1.e., Satisfaction 
with Status and Recognition, Overall Work Stress, Satisfaction with Work Load). The 
school administrator’s role as an intermediary between the collective interests of the 
school and the individual concerns of teachers, students, and parents places him or 
her in stressful situations. These stressful situations, Maslach (1982) claimed, con- 
tribute to burnout because “they are another source of emotional stress that adds to 
the development of emotional exhaustion and negative feelings about people” (p. 42). 

Administrators claimed that negative feedback was the main contributor to 
Personal Accomplishment burnout. Pines, Aronson, and Kafry (1981, p. 118) recog- 
nized that feedback in the form of rewards and significance is a potential buffer 
against burnout. In the case of this study, insufficient or negative feedback was re- 
lated to Personal Accomplishment burnout. The other work features identified by ad- 
ministrators as leading to Personal Accomplishment burnout—that is, interpersonal 
relationships and the perceived pressures from higher authorities—were similar to 
the predictors of this subscale (i.e., Satisfaction with Interpersonal Relationships, 
Security and Involvement, Advancement, and Autonomy). Administrators experi- 
enced Personal Accomplishment burnout as a result of the belief that the system was 
thwarting their opportunities for advancement and participation. This finding 
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supports Maslach’s (1982) assertion that failure of the job to fulfill the needs of auton- 
omy and control makes people feel “helpless, powerless, and trapped . . . [and] the 
betting is that they will burn out” (p. 66). 

The qualitatively derived data, though supportive of the findings of the quantita- 
tive analysis, were, however, not entirely in agreement with them. For one thing, they 
did not separate out the perceived sources of each dimension of burnout. For example, 
satisfaction with interpersonal relationships entered as a predictor of only one dimen- 
sion of burnout in the regression analysis. In the open-ended responses, interpersonal 
relationships were related to all three burnout dimensions. Second, in the qualitative 
analysis, all three forms of burnout were linked, to a degree at least, to practically 
every work condition identified. Such discrepancies raise serious issues about the na- 
ture of burnout and the measurement of sources of burnout among administrators. It 
should be noted that the regression analysis reveals only statistically significant infor- 
mation, and that the much-needed qualitative approaches to the study of burnout 
could contribute additional understandings. Regardless, the findings of the study re- 
ported in this paper are useful and revealing. However, in view of the discrepancies 
between the two methods and the limitations imposed by the research design, they 
should be used primarily to assist in the further development of research questions 
and research designs. 


Concluding Note 

Generally, the findings of this study are gratifying. School-based administrators ap- 
peared to be coping quite well with the rigors of their jobs, and reported only moder- 
ate to low levels of Emotional Exhaustion and Depersonalization burnout in the 
majority of cases. A major cause of concern in over a third of all administrators, how- 
ever, was that they experienced a higher than normal level of Personal Accomplish- 
ment burnout. The indications from this study are that this type of burnout is most 
closely associated with the failure of the job to provide positive feedback and to fulfill 
adequately the needs of status, recognition, and satisfying interpersonal relation- 
ships. Further, these findings tend to support those of Brookings, Bolton, Brown, and 
McEvoy (1985) who noted that “personal accomplishment is largely independent of 
emotional exhaustion and depersonalization” (p. 149). Research may explore whether 
increased positive feedback as well as training in human relations skills would reduce 
Personal Accomplishment burnout among school administrators. The findings of this 
research also indicate that these strategies may assist in buffering the onset of 
Depersonalization burnout. Future research could examine the extent to which the 
motivational needs of administrators are critical determinants of burnout and con- 
centrate less on the impact of job-specific factors as contributing to administrator 
burnout. 
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Relationships were examined between student teachers’ levels of cognitive develop- 
ment and their (a) perceptions of their relationships with cooperating teachers; 
(b) evaluations of cooperating teachers as supervisors; and (c) cooperating teachers’ 
evaluations of the student teachers. At the beginning of a semester, 51 student teachers 
completed objective tests measuring the four stages of cognitive development defined 
by Conceptual Systems Theory, and scales measuring dispositional (personality) 
stress. At the end of the semester, subjects completed the Barrett-Lennard Relation- 
ship Inventory and a scale evaluating their cooperating teacher. At the end of the se- 
mester, cooperating teachers also evaluated the student teachers. Student teachers 
gave highest ratings to cooperating teachers whom they perceived as empathetic and 
friendly. Student teachers who appeared to be concrete (rather than abstract) thinkers 
tended to be most satisfied with their cooperating teachers. In turn, cooperating 
teachers assigned highest ratings to student teachers who appeared to be non- 
achievement oriented, and who were comfortable with rigid patterns of behavior. Re- 
sults are interpreted to reflect a preference for “facilitative” rather than for “delibera- 
tive” supervision during student teaching. 


The purpose of this study was to examine the significance of cognitive level, as defined 
by Harvey, Hunt, and Schroder’s (1961) Conceptual Systems Theory, to one aspect of 
student teachers’ field experiences. How does the cognitive level of student teachers 
affect their perceptions and evaluations of their cooperating teachers? Is the cognitive 
level of student teachers significantly related to cooperating teachers’ evaluations of 
the student teachers’ competencies? Underlying these questions was the assumption 
that cognitive level affects the way individuals perceive, organize, and evaluate infor- 
mation, including information about others. Because perception has an evaluative 
component, it may indirectly affect the way relationships between individuals are 
perceived. 


Dr. Kagan is currently an associate professor of teacher education who enjoys designing and conducting cooperative research 
projects with local school districts. 
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Conceptual Systems Theory (CST) defines cognitive development in terms of an 
increased ability to think abstractly, as well as specific changes in personality and per- 
ception. According to CST, as individuals progress from simple (concrete) thinking to 
complex (abstract) thinking, they also move from one specific set of beliefs to another. 

According to the CST model, there are four developmental stages, or conceptual 
systems, each associated with characteristic beliefs. Individuals representing 
System 1 tend to be extremely rigid, absolute in their judgments, and in need of struc- 
ture. This first stage is also associated with a compulsive concern with religion and a 
reliance on authority. Individuals representing System 2 tend to be rebellious, to re- 
ject social convention, and to have an unusually pervasive distrust of people. 
Prototypes of System 3 are concerned with affect: the feelings of others, friendship, 
and social acceptance. They also tend to feel special and to have a strong internal 
locus of control. Finally, representatives of System 4 can be described by relativism, 
flexibility, low evaluativeness, and the tendency to use multiple perspectives in per- 
ceiving information. Individuals at this last stage tend to rely on internal standards 
and a sense of personal competence, and to be concerned with balance and equity in 
social interactions. 

Level of cognitive development, as defined above, has been related to ways of pro- 
cessing information. Studies have shown that subjects at abstract levels tended to dif- 
ferentiate, to encode, and to infer from information to a greater degree than subjects 
at concrete levels (Sieber & Lanzetta, 1966). In experiments where subjects were pre- 
sented with contradictory pieces of information about hypothetical individuals, con- 
crete subjects reacted more negatively toward the contradictions and tended to use a 
single process for resolving them (Crano & Schroder, 1967). Subjects functioning at 
lower, more concrete developmental levels also tended to be more evaluative and dog- 
matic in their attitudes than their abstract counterparts (Harvey & Ware, 1967). 
One’s level of cognitive development, as defined by CST, appears to describe a great 
deal more than simply the ability to think abstractly. In a very literal sense, it may af- 
fect the way one perceives and evaluates people and behaviors and the way one is 
likely to resolve conflict and contradiction. 

CST appears to have special relevance for the field of education, because a grow- 
ing body of empirical research has related some of its dimensions to the behaviors and 
effectiveness of teachers at all levels of education. The cognitive variable most fre- 
quently examined in these studies has been the tendency (of teachers) to be concrete 
or abstract in their thinking. Scores obtained on a projective test of this dimension 
have been used to dichotomize teachers as concrete versus abstract thinkers. In gen- 
eral, abstract teachers have been found to be more flexible, to encourage more student 
involvement, to have a greater diversity of activities in their classes, and to have a 
greater tolerance for stress than concrete teachers (e.g., Harvey, White, Prather, 
Alter, & Hoffmeister, 1966; Hunt & Joyce, 1967; Miller, 1978). 

The relevance of cognitive level, as defined by CST, to the student teaching expe- 
rience can be inferred from recent studies by Tardif (1985) and Campbell (1985). 
Tardif’s in-depth study of student teachers’ attitudes and thoughts through three se- 
mesters of teaching revealed basic “perspectives,” or themes, common to the entire 
student teaching experience. Cognitive level, as a construct, is clearly related to two of 
these themes: Seeing Oneself as a Teacher, and Taking the Path of Least Resistance. 
To See Oneself as a Teacher involves the ability of student teachers to perceive their 
cooperating teachers’ point of view. The ability to assume another’s perspective has 
been theoretically and empirically related to higher levels of cognitive development 
(Harvey, et al., 1961; Harvey & Ware, 1967; Sieber & Lanzetta, 1966). Taking the Path 
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of Least Resistance refers to student teachers’ attempts to avoid conflict with cooper- 
ating teachers. The perception of inconsistency and conflict—that is, whether one 
labels a situation as such, and how negatively one may respond to it—has also been 
empirically related to cognitive level (Crano & Schroder, 1967). Thus one can logically 
assume that student teachers’ cognitive level (as defined by CST) would indirectly af- 
fect how they evaluate themselves and how they perceive their cooperating teachers. 

Campbell (1985) examined factors that influenced student teachers’ actual class- 
room behavior. Self-report data indicated that the one factor given priority in all 
phases of teaching was “student teachers’ ideas, beliefs, and characteristics.” If one 
considers the attitudes associated with each level of cognitive development in CST as 
being pervasive characteristics, then the relevance of CST to the student teaching ex- 
perience is clear—the ideas, beliefs, and perceptions related to each cognitive level 
play a key role in determining the actual classroom behaviors of student teachers. 
These behaviors, in turn, are likely to influence the attitudes and behaviors of cooper- 
ating teachers. Thus the quality of the interpersonal relationships that exist between 
student teachers and cooperating teachers is indirectly affected by student teachers’ 
cognitive levels. 

The Barrett-Lennard Relationship Inventory (Barrett-Lennard, 1962) is a self-re- 
port measure designed to assess four elements that Rogers (1962) described as neces- 
sary for any effective helping relationship: (a) Congruence: the perception that the 
other person is genuine and without facade; (b) Empathy: the feeling that the other 
person is experiencing an accurate and sensitive awareness of you; (c) Positive Re- 
gard: the feeling that the other person is warm, accepting, and cares for you; 
(d) Unconditioned Regard: the perception that acceptance is total rather than condi- 
tional. 

Though the Barrett-Lennard Inventory was developed to assess the relationship 
between a client and a therapist, it has been used extensively in educational contexts. 
Several researchers have found significant positive correlations between students’ 
responses on the Barrett-Lennard and their academic gains, as well as the amount 
they communicated with their teachers (Emmerling, 1961; Lewis, Lovell, & Jesse, 
1965). Indeed, theorists have claimed that. learning cannot occur unless pupils per- 
ceive that an effective helping relationship exists with their teacher (Bush, 1959; 
Rogers, 1962; Wingo, 1960). If pupils learn most effectively and communicate more in 
the framework of a good relationship, one can assume the same would be true of 
student teachers who are, in a sense, pupils of their cooperating teachers. The present 
study represents a unique attempt to use the Barrett-Lennard Inventory together 


with measures of cognitive level to examine the student teacher-cooperating teacher 
dyad. 


Method 

Subjects were 51 student teachers in the Department of Teacher Education at the 
University of Nebraska at Omaha. Thirty-three percent were in their first semester of 
student teaching, the remaining in their second semester. Males represented 22% of 
the sample, females 78%. The mean age was 29.5 years (SD=8.5 years). Fifty-nine 
percent of the subjects were teaching at the elementary level, 41% at the secondary 
level. The cooperating teachers (V=51) working with the student teachers were expe- 
rienced teachers who had previously hosted student teachers. 
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Instruments 

Cognitive level. The cognitive level of student teachers was assessed with several 
scales from the Belief Systems Test, the most recent and experimental set of objec- 
tive, Likert-type scales developed by Harvey and his colleagues (Gore, 1985; Harvey & 
Gore, 1979; Harvey & Hoffmeister, 1971). Cognitive level has traditionally been meas- 
ured by a projective, paragraph-completion task which required trained raters to 
assign a respondent to one conceptual system on the basis of an overall, subjective im- 
pression of the paragraph. Items in each subscale of the experimental Belief Systems 
Test were selected on the basis of both their internal consistency and their ability to 
discriminate prototypes of the four conceptual systems. Eleven scales from the inven- 
tory were selected for use here, because they were shown to be most significant in dis- 
criminating subjects in each of the four conceptual systems (Gore, 1985). It should be 
noted that the discriminant power of these subscales was demonstrated with a sample 
of prototypical (pure) representatives only. Subjects representing admixtures of the 
four systems were not used in the analysis. In addition, though the 11 subscales were 
instrumental in correctly classifying 74% of all prototypical subjects, accuracy rates 
varied per system: that is, the highest accuracy rate obtained for the lowest system, 
the least accurate results obtained for the highest system. Coefficient alpha for the set 
of scales ranged from .61 to .96; and scores on each subscale were shown to correlate 
significantly with results of the subjective, paragraph-completion task used to assess 
cognitive level (Gore, 1985). 

The 11 subscales and the conceptual systems with which they are associated are 
described below, along with sample items. Five alternative responses were provided 
for each item, ranging from disagree strongly to agree strongly. System 1 scales: 
Effectance Via God: “I believe J will succeed in life if I closely follow God’s teachings.” 
Centrality: God: “The greatest loss I could suffer would be my relationship with God.” 
It should be noted that it is not the belief in God that is associated with System 1, but 
the tendency to attribute all success and failure to an external power. Scales associ- 
ated with System 2: Distrust of People: “These days a person doesn’t really know 
whom he/she can count on.” Depression: “I generally hate to face a new day.” Distrust 
of Authority: “Most people in authority and power have gotten there through lying 
and taking advantage of weaker people.” System 3 scales: Sympathy: “I like my 
friends to confide in me and to tell me their troubles.” Personal Effort and Will: “I can 
do anything I want to if I really make up my mind to do so.” Acceptance of Obligation 
(scoring of items is reversed): “I hate to accept a favor because it makes me feel obli- 
gated.” System 4 scales: Rejection of Polarity (scoring is reversed): “All people may be 
classified as good or evil.” Constancy of Change: “There is nothing in the universe 
which is not in the process of constant change.” Rejection of Regular Schedule (scor- 
ing reversed): “I like to have my meals organized and a definite time set aside for eat- 
ing” (Gore, 1985). 

Scores obtained by subjects on each of these subscales were treated as continuous 
variables, that is, scales were not used to dichotomize or classify subjects according to 
any one conceptual system. This procedure was chosen on the assumption that most 
individuals are not prototypical representatives of only one developmental stage, but 
instead probably represent admixtures of several levels (Gore, 1985; Harvey & Gore, 
1979; Santmire, 1984). 

Relationship with cooperating teacher. Student teachers’ perceptions of their re- 
lationships with their cooperating teacher were assessed with subscales from the 
Barrett-Lennard Relationship Inventory, which together appear to assess the overall 
quality of an interrelationship: Level of Regard, Congruence, and Empathy (Lanning 
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& Lemons, 1974). Scores on these subscales described how student teachers perceived 
their cooperating teachers had responded to them during the course of one semester 
of student teaching. Alpha has been reported above .79 for these scales. 

Evaluation of cooperating teachers. Student teachers’ evaluations of their coop- 
erating teachers were measured with a questionnaire designed for this study. The 
items were based on teachers’ descriptions of the behaviors of consultants who worked 
with them in solving classroom problems (Hansen, Himes, Dowd, & Sartore, 1977). In 
essence, cooperating teachers function as teacher-consultants, assisting student 
teachers to diagnose and resolve classroom problems. According to 140 critical 
incidents described by the teachers in the Hansen et al. study, consultants performed 
five basic functions: listening, questioning, instructing, observing, and structuring. 
Consequently, eight items were developed which asked the respondents (student 
teachers) to rate the ability of their cooperating teachers to: listen carefully to them; 
ask them meaningful questions; teach them skills, attitudes, or behaviors; observe 
their teaching and provide feedback; structure lessons or learning experiences for 
them: suggest alternative ways to solve problems; correctly judge the amount of inde- 
pendence they should have; and finally, ability to portray a good role model of a teach- 
er. A five-point Likert scale was provided; ranging from poor to excellent. 

Student teachers’ competency. The competency of student teachers was evalu- 
ated by their cooperating teachers using a 34-item Likert scale routinely used in the 
Department of Teacher Education at the University of Nebraska at Omaha. Items 
asked cooperating teachers to rate a student teacher’s ability to teach, maintain disci- 
pline and rapport with students, as well as the ability to establish a professional de- 
meanor. A five-point Likert scale ranged from poor to excellent. 

Stress. Because how one perceives stress has been positively related to cognitive 
level (Oswald, 1971), it was decided to evaluate student teachers’ inherent levels of 
stress with the Dispositional Stress scales, developed and used by the author in prior 
studies (Albertson & Kagan, in press; Kagan & Albertson, in press). The inventory 
consists of five scales designed to assess personality traits that tend to induce or to 
amplify stress across a variety of situations. Five separate subscores are yielded: 
Overload (a pervasive sense that too much is expected of one), Lack of Self-confi- 
dence, Time Urgency, Need to Keep Busy (the tendency to be stressed by a lack of 
stimulation), and Anxiety. Alpha was reported as ranging between .73 and .88 for the 
set of scales. Scores obtained by teachers on these scales were shown to be strongly re- 
lated to their perceptions of occupational stress, stresses inherent in the teaching 
profession (Albertson & Kagan, in press). It was felt they were likely to be relevant to 
the perceptions and experiences of student teachers as well. 


Procedure 

Student teachers completed the tests of cognitive level and dispositional stress at the 
beginning of the semester. At the end of the semester, they completed the Barrett- 
Lennard scales and the evaluation of their cooperating teacher. Cooperating teachers’ 
evaluations of the student teachers were also completed at the end of the semester. 
Though student teachers and cooperating teachers were providing evaluations of each 
other, they understood that they were assigned special identification numbers to 
assure the confidentiality of their responses. In fact, names never appeared on the 


questionnaires, only the identification numbers. All questionnaires were distributed 
and returned by mail. 
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Data Analysis 

Items on each subscale were summed for each subject. Responses to items on the 
questionnaire evaluating a cooperating teacher were summed to yield a single global 
rating. A similar procedure was followed for the questionnaire completed by cooperat- 
ing teachers to evaluate the student teachers. 

Interrelationships among all variables were assessed via Pearson Product- 
Moment correlation coefficients. In addition, nine separate multiple regression ana- 
lyses were conducted, predicting the score on each item on the questionnaire used to 
evaluate the cooperating teacher, then predicting the global score on the entire ques- 
tionnaire. Predictors in each regression equation were: scores obtained by student 
teachers on measures of cognitive level, dispositional stress, and the relationship 
scales. 


Results 

Each of the subscales of the Relationship Inventory was strongly related to student 
teachers’ global assessments of their cooperating teachers (Table 1). The Relationship 
Inventory was also positively correlated with System 1 scales, representing the lowest 
conceptual level (r ranged from .25 to .29), and negatively correlated with System 4 
scales, measures of the highest conceptual level (r ranged from —.26 to —.40). In other 
words, student teachers at higher cognitive levels tended to perceive their relation- 
ships with cooperating teachers in less positive terms. Because higher conceptual level 
has been related to an articulated and complex style of information processing (Sieber 
& Lanzetta, 1966), student teachers at higher levels may have perceived their cooper- 
ating teachers more critically or in more complex terms. 

Cooperating teachers’ evaluations of their student teachers were unrelated to any 
part of the Relationship Inventory, but they were negatively related to one of the 
System 4 scales: Rejection of a Regular Schedule (r= —.36). Cooperating teachers 
may have associated the predictability and conventionality of a student teacher’s be- 
havior with professionalism. Thus student teachers who tended to adhere to fixed and 
predictable patterns of behavior obtained higher evaluations. 

Bivariate correlation coefficients between measures of stress and teacher ratings 
(not shown) were not significant, except for a significant correlation between student 
teachers’ scores on the Need to Keep Busy scale and the global rating assigned by 
their cooperating teachers: r=—.47. One could infer that the less achievement- 
oriented (less active) student teachers were, the more cooperating teachers seemed to 
approve of them. This was consistent with cooperating teachers’ approval of consist- 
ency and fixed patterns of behavior among student teachers. 

When separate regression equations were constructed predicting each item on the 
rating scale used by student teachers to evaluate cooperating teachers, results were 
surprisingly consistent (Table 2). In every case, nearly the entire amount of account- 
able variance was contributed by one of the subscales of the Relationship Inventory, 
particularly Empathy or Regard. Multiple R’ ranged from .35 to .89, indicating that 
on some items student teachers’ evaluations of their cooperating teachers could be 
predicted almost entirely from their responses to the Relationship Inventory. 


Discussion 
If one distinguishes between cognitive predictors (scores obtained by student teachers 
on the CST measures of cognitive level) and affective predictors (scores on the Rela- 
tionship Inventory), then student teachers appeared to define the competency and ef - 
fectiveness of cooperating teachers primarily in affective terms. They gave highest 
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TABLE 1 


Intercorrelations Among Teacher Ratings, 
Relationship Scales, and Measures of Cognitive Level 


Relationship Inventory 


Student 
teachers’ global Cooperating 
rating of their teachers’ global 
cooperating rating of their Level of 
teacher student teacher Regard Empathy Congruence 
Relationship 
Inventory 
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ratings to cooperating teachers whom they perceived to be empathetic and warm. 
Student teachers appeared to value cooperating teachers who were friendly, liked 
them, and understood their problems. How helpful cooperating teachers were in an 
objective, substantive sense may have been relatively unimportant to student 
teachers. An even more disturbing inference emerged when it was recalled that scores 
on the Relationship Inventory were negatively related to student teachers’ conceptual 
level, that is, the aspects of their relationships with cooperating teachers that were 
most highly valued by student teachers (Empathy, Regard) were also those associated 
with the lowest conceptual levels among student teachers. This indirectly suggested 
that the student teaching experience may have been perceived most positively by 
student teachers who represented the lowest level of cognitive development (as de- 
fined by CST). 

Findings can be easily related to the model of supervisory behavior suggested by 
Boldt and Housego (1986) and Housego and Grimmett (1983), which distinguishes be- 
tween “facilitative” and “deliberative” supervisors. Facilitative supervisors attach 
greatest priority to the threats student teachers perceive in class experiences. Accord- 
ingly, these supervisors function primarily as kind, considerate, assuring counselors. 
“Deliberate” supervisors are knowledge-based, oriented toward discussing alternative 
strategies for dealing with classroom problems. Though this model was based on the 
behaviors of university supervisors, it could logically be extended to cooperating 
teachers, who also serve in a supervisory capacity. 
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TABLE 2 


Multiple Regression Analyses 
(N=51) 


back 


Beta 
Dependent Variable R Predictor re Weight 
Student teachers’ global rating of coop- 84 Empathy .70 834 
erating teacher 
Cooperating teachers’ global rating of 54 Need to keep busy 30 —1548 
student teacher (stress) 
Individual items on which student teachers evaluated cooperating teachers: 
Cooperating teacher’s ability to... 
Listen carefully to your comments and Congruence 14 nOP AE 
problems 
Empathy 79 630 
Rejection of Polarity .85 .238 
(System 4) 
94 Distrust of Authority 89 194 
(System 2) 
Ask you meaningful questions about His) Regard 59 .780 
your feelings, lessons, beliefs, etc. 
Teach you skills, attitudes, or behaviors Regard ie) .845 
.80 Anxiety (stress) .64 O00 
Observe your teaching and provide feed- Empathy 46 M33 
.76 Anxiety (stress) .58 = al 
Structure lessons or learning experi- aie Regard 61 iE 
ences for you 
Suggest alternative ways to solve some Empathy 35) 12 
teaching-related problems you had 
Anxiety (stress) ol — 309 
.78 Effectance via God 61 333 
(System 1) 
Correctly judge the amount of freedom 664 Empathy Al 643 
and independence you should have had 
in planning and teaching 
To portray a good role model of a teacher Empathy a tne .893 
Need to Keep Busy 85 — 319 
(stress) 
94 Centrality: God .89 sore All 
(System 1) 


It could be inferred that the student teachers surveyed in the present study pre- 
ferred cooperating teachers who were facilitative and supervision that was (in Boldt 
and Housego’s terms) “indulgent” and “impoverished.” More provocatively, results 
also suggested that student teachers’ preferences in terms of supervisory style may 
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depend in part on their levels of cognitive development, more concrete-thinking 
student teachers preferring facilitative supervisors. Overall, student teachers in the 
present study assigned higher ratings to cooperating teachers whom they perceived as 
facilitative. Did these cooperating’ teachers, in fact, function as facilitative 
supervisors? How accurate were student teachers’ perceptions of their cooperating 
teachers’ performance and attitudes? 

One might cite the lack of significant correlations between cooperating teachers’ 
evaluations of student teachers and student teachers’ perceptions of the interpersonal 
relationships as evidence that student teachers misinterpreted the feelings of their co- 
operating teachers. In this regard, results forecast a series of future studies examining 
the mutual perceptions of student teachers and cooperating teachers. Is accuracy of 
perception associated with student teachers’ levels of cognitive development? Results 
also forecast a series of “matching” studies in which dyads of cooperating teachers and 
student teachers matched for cognitive level might be compared with mismatched 
dyads. Do dyads matched for cognitive level perceive one another more accurately or 
regard one another more positively than do mismatched dyads? Handley (1982) 
found this was partially true for counselors and trainees matched for cognitive style, 
as measured by the Myers-Briggs scales. . 

It might also prove useful to pursue inferences of another finding presented here: 
Cooperating teachers assigned higher ratings to student teachers who, in part, resem- 
bled prototypes of System 1, the lowest conceptual level. This suggests that cooperat- 
ing teachers approved most of student teachers who represented the most concrete, 
simplistic thinkers. Cooperating teachers also assigned higher ratings to student 
teachers who obtained relatively low scores on the Need to Keep Busy (stress) scale. 
Items on this scale assess the tendency to be type-A: ambitious and achievement 
oriented. Assuming results are generalizable to populations beyond this study, in 
what other ways might we be discouraging achievement-orientation and superior cog- 
nitive performance among preservice teachers? Would university supervisors’ evalua- 
tions of student teachers be similar to those of cooperating teachers in rewarding 
student teachers at the lowest conceptual levels? 

The conceptual level at which student:teachers and their cooperating teachers 
function is an issue hitherto ignored in empirical literature. Conceptual Systems 
Theory represents but one of the developmental models of cognitive development 
that might be applied to teacher training. Considering the pervasive influence one’s 
level of cognitive development most likely has in terms of how one perceives and eval- 
uates situations and people, it seems a crucial variable to examine. We have long 
acknowledged in other content areas that the differential aptitudes of students de- 
mand differential kinds of instruction. It is reasonable to assume the same would be 
true of teacher training. 


Limitations 

Inferences should be regarded as hypothetical and tentative in light of several limita- 
tions inherent in this study. First, cognitive level must be understood strictly within 
the definition of Conceptual Systems Theory. The Likert-type subscales used to 
measure conceptual level are still experimental in nature. To what degree the scales 
assess something more than attitude or personality—that is, level of cognitive devel- 
opment—may constitute a debatable issue. In addition, data were correlational in na- 
ture, and thus cannot be construed as evidence of director causal relationships 
between variables. 
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We were unable to statistically compare correlation matrices for first versus sec- 
ond semester student teachers, or for male versus females, due to the ratio of subjects 
within these subgroups to the number of variables. Thus the conclusions drawn here 
may not be generalizable across all these subgroups. It would be interesting, for ex- 
ample, to compare student teachers completing their first versus their second semes- 
ter of student teaching. Do student teachers in both groups prefer facilitative rather 
than deliberative cooperating teachers? O’Neill (1985) presented evidence strongly 
suggesting that results obtained for a sample of female student teachers are likely to 
be significantly different from those obtained from male student teachers. O’Neill 
cited two reasons to expect gender-based differences. First, supervisors tend to de- 
value the performance of same-sex student teachers. Second, the perceptions of fe- 
male student teachers are more likely to be consistent with the ways cooperating 
teachers actually evaluate them, because females are more socialized to feedback. Fe- 
male student teachers would be more sensitive and responsive to evaluative feedback 
from cooperating teachers, so feedback would be more likely to affect behavior. 
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Assessments by Teachers of their 
Preservice Preparation Programs 


Graduates of Alberta teacher education programs were surveyed to determine the ex- 
tent to which their preservice preparation provided them with the knowledge and skills 
required for effective teaching. Teachers viewed programs as having more potential to 
develop knowledge and understandings than skills. Preparation programs were rated 
as most effective in the areas in which teachers considered them to have the highest 
potential; however, these areas were not highly important to teachers in their daily 
work. Practicums were judged to be the most significant component of teacher prepara- 
tion. The significance attached to other education components was related directly to 
the degree to which they were judged to be practical. Results of the study raise, once 
again, issues related to the general orientation of preservice teacher education pro- 
grams. 


Research which attempts to strengthen the knowledge base for teacher education 
takes a variety of different forms. Some studies focus on selected components of 
preservice preparation or on the experiences of particular individuals while others at- 
tempt global assessments of programs. The project on which this report is based falls 
into the latter category. More specifically, an evaluative study of teacher preparation 
programs in Alberta was carried out using a follow-up approach.’ Although the re- 
search relates to a specific context, both the method and the results may be of more 
general interest.’ 


Dr. Miklos is a professor of educational administration with interests in research methods and development in organization 
theory. 

Dr. Greene is Director of the Education Research, Evaluation and Development Centre and is currently involved in a number of 
research and evaluation projects on teaching and teacher education. 
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Problem and Purpose 

The major problem addressed by the study was the limited amount of current infor- 
mation available to decision makers about the strengths and shortcomings of teacher 
preparation programs offered by Alberta universities. In view of this consideration, a 
project was designed to address the following question: To what extent do preservice 
preparation programs provide teachers with the knowledge and skills which are im- 
portant for effective teaching? Although answers to the question could be sought in 
various ways, teachers who had graduated recently from programs were assumed to be 
a valid source of information. 

A number of more specific questions served as guides to the development of the 
study: 
1. How important are selected knowledge, skills, and understandings to teachers in 

their current positions? 


2. What potential do teachers see for preservice programs to develop those knowI- 
edge, skills and understandings? 


3. How effective are preservice preparation programs in helping teachers to acquire 
the requisite knowledge and to develop important skills? 


4. What suggestions do teachers have for improving preservice preparation pro- 
grams? 

Within the context of these questions, attention was directed also to the contributions 

made by different components of preservice programs to teacher preparation as well 

as to variations in perceptions of teachers who had pursued different routes. The 

major method of data collection was a questionnaire survey supplemented by 

interviews. 


Background 

In spite of the research which has been conducted, efforts to improve teacher educa- 
tion programs have been hampered by insufficient information about the adequacy of 
existing practices. A review of studies that could be classed as evaluations of teacher 
preparation programs revealed that not -much has been reported in this area 
(Hawryluk & Greene, 1986). Few Canadian universities have initiated comprehensive, 
longitudinal, planned evaluations grounded in a conceptual model of what preservice 
programs attempt to accomplish or what contribution they make to the development 
of particular knowledge and skills. However, there have been numerous studies of 
programs which are specific to particular geographic areas or institutions (Greene, 
1979; Haines & Robertson, 1985; McCutcheon, 1979; Ratsoy, McEwen & Caldwell, 
1979; Rieger & Woods, 1971; Rieger, 1974). Predominant among these are follow-up 
studies of graduates. Such studies have been criticized for their conceptual and 
methodological inadequacies, for the lack of representative sampling, for the limited 
use to which the results are put by the institutions which sponsor them, and for their 
problematic generalizability (Katz, Raths, Mohanty, Kurachi, & Irving, 1981). Al- 
though these limitations cannot be denied, research of this nature does have potential 
value. 

Studies which have been conducted in Canada indicate that some preservice pro- 
grams tend to have a strong academic orientation (Ratsoy, McEwen & Caldwell, 
1979). With respect to particular components, the practicum consistently emerges as 
the experience which is regarded as most valuable by teachers (Wideen & Holborn, 
1986). Of the other contributions made by the faculty of education, curriculum and in- 
struction courses are generally assessed more favorably than are courses in others 
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areas (Rieger & Woods, 1971). Programs have been criticized by graduates for an in- 
sufficient emphasis on skill development in areas such as classroom management, 
communication, diagnosing learning difficulties, and specific curriculum develop- 
ment and implementation skills (Haines & Robertson, 1985; Rieger, 1974). More 
major concerns relate to the general fragmentation of the program and to the gap be- 
tween theory and practice (Ratsoy, McEwen & Caldwell, 1979; Wideen & Holborn, 
1986). 

In order to be useful for decision making, results of such studies must be validated 
for specific programs. Validations are also replications which may contribute to the 
development of general knowledge about teacher education. For those reasons, 
follow-up studies seem to have a legitimate place in research on teacher education. 


Method 

Because one of the objectives was to ground the study of preservice programs in re- 
search on teacher effectiveness, the relevant literature was reviewed in the search for 
potential questionnaire items (see Hawryluk, 1986). An additional thrust was to re- 
view goals of teacher education programs in order to identify items that appeared to 
have potential for assessing preservice programs (see Hawryluk & Greene, 1986). 
Particular attention was given to relevant documents—goal statements, program and 
course descriptions—of Alberta teacher education programs to identify explicit or im- 
plicit objectives. Questionnaires which had been used in similar studies—both in gen- 
eral follow-up studies and in program evaluations—were also reviewed. 

The preliminary work indicated a need for a conceptual framework which would 
provide the basis for developing a sufficient number of valid items. Through discus- 
sions of the results of the literature review with a number of teacher educators, a cate- 
gorization of nine teacher roles—for which Lanier (1984) served as a point of 
departure—was developed The conceptualization identified nine clusters of activities 
or teacher roles. More specifically, teachers were viewed as effective persons, as 
skilled practitioners, as decision makers, as creators of learning environments, as par- 
ticipants in establishing human relationships, as learners, as specialists in the content 
and process of learning, as participants in curricular and educational change, and as 
professional educators. 

A number of teacher educators used the categorization of roles to develop knowl- 
edge, skill and understanding items and critiqued each other’s suggestions in several 
sessions. Eventually, more than 60 potential items were identified. Selection of items 
for inclusion in the questionnaire was based on the dual criteria of the appropriate- 
ness rating by a sample of teacher educators and representativeness across the nine 
categories of teacher roles. Through this process, 40 items were tentatively selected 
and reduced to 30 after a series of pilot tests of draft questionnaires.’ 


Design of the Questionnaire 
The final form of the questionnaire consisted of six parts as described below: 

Part A. Potential Contribution of Preservice Preparation—30 items with one 
response for each. The response scale ranged from P where preservice 
preparation is the main contributor to E where experience is the major 
contributor through three combinations of preparation and experience. 

Part B. Effectiveness of Preparation Programs—30 items with two responses 
(importance and effectiveness) for each. On importance the scale ranged 
from 5 (High Importance) to 1 (Low Importance). The program effective- 
ness scale ranged from 5 (High Effectiveness) to 1 (Low Effectiveness). 
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Part C. Components of Preservice Preparation—seven questions with a rating 
scale and space for comments. Seven components were defined which re- 
flected groups or clusters of courses in each of the programs: courses 
taken outside of the Faculty. of Education; curriculum and instruction 
courses; courses in educational psychology; courses in history, philoso- 
phy, or sociology of education; courses in the organization and adminis- 
tration of education; practicum experiences; and general interest courses. 
A scale ranging from 7 (Highly Significant) to 1 (Of Limited Significance) 
was used. Space was provided in the questionnaire for comments on each 
component. 


Part D. Strengths and Shortcomings of Preparation Programs—three open- 
ended questions on the strengths, shortcomings, and desired changes in 
preservice preparation programs were also included. 


Part E. Background Information—11 questions with varying response categor- 
ies relating to preparation program characteristics and information on 
the respondent’s current teaching assignment which was intended for use 
in describing the sample and analyzing the data. 


In Part F respondents were asked if they-would be willing to participate in an in-per- 
son interview. A separate response form was enclosed. 


Data Collection Procedures 

The purposes of the study called for the selection of a sample from among teachers 
who were graduates of Alberta teacher preparation programs and who were in their 
first, third, or fifth year of teaching. Access to the population was through the records 
of certificated teaching personnel in Alberta maintained by the Department of Edu- 
cation. In an effort to obtain as large a sample as possible, a decision was made to 
survey all first-year teachers and to sample equal numbers of teachers in their third 
and fifth years of teaching. 

The packet which was mailed to 1292 teachers in February 1986, included the fol- 
lowing items: a covering letter, a copy of the questionnaire, a form on which teachers 
could indicate a willingness to participate in an interview, and a postage-paid return 
envelope. A follow-up letter was sent about two weeks later. In total 523 completed 
questionnaires were received of which 515 were used in the analysis. A summary of the 
return rate by year of teaching is shown in Table 1. 

As is indicated in the table, the overall response rate was about 40%. The percent- 
age of responses for teachers in their fifth year of teaching was higher than that for 
teachers in their first or third year. However, the responses of first-year teachers rep- 
resented a higher percentage of the population than did third- or fifth-year teachers. 
Any generalizations from the results will clearly be biased in the direction of the 
responses of first-year teachers. Furthermore, because the majority of the responses 
were from graduates of the University of Alberta Faculty of Education, the results of 
the analysis reflect responses relevant to that particular institution to a dispropor- 
tionate degree. 

In total, 224 respondents completed the form on which they indicated a willing- 
ness to participate in interviews. The research plans called for a maximum of 45 
interviews; 42 teachers were actually interviewed during the months of May and June 
1986. The interviews were conducted with teachers individually, either at the school, 
at their homes, or in an office setting. Duration of the interviews varied but tended to 


be about one hour. Although a number of interviewers were involved, a common guide 
was used. 


194 


Assessments of Preservice Preparation Programs 


TABLE 1 


Distribution of Population, Sample and Respondents 
by Number of Years of Teaching Experience 


-_—_-_eeee eee — 


Year of Number in Number in Number Percent Percent of 
Teaching Population Sample Returned Return Population 
First 560 552 213 38.6 38.0 
Third 595 370 141 38.1 23.7 
Fifth 855 370 169 45.7 19.8 
Total 2010 1292 523 40.5 26.0 
Data Analysis 


All questionnaire data, including the responses to open-ended questions, were en- 
tered into a computer file. The primary mode of analysis of quantitative data was to 
compute frequencies, percentages, and means. Statistical tests were applied to estab- 
lish whether differences or contingencies merited discussion; however, the outcomes 
of the tests were regarded mainly as guides rather than as a definitive basis for inter- 
preting the results of the survey. 

An intensive content analysis of responses to open ended questions was carried 
out. Some of the results of this aspect of the study have been reported by Greene and 
Miklos (in press). Interviews were also subjected to appropriate analysis, and a com- 
prehensive summary of 15 of the interviews was prepared by Chapman (1986). 


Results 

The results of the data analysis are reported primarily in relation to the 30 knowledge/ 
skills/understandings items included in the questionnaire. The analysis revealed the 
extent to which various items were important to teachers, the extent to which teachers 
considered preservice programs to have potential for developing requisite knowledge/ 
skills/understandings, and the effectiveness of programs in preparing teachers in 
those areas. In addition, the contributions made by various components as well as the 
strengths and shortcomings of programs were revealed both by the questionnaire and 
the interview data. 


Importance 

In analyzing the data on importance of selected knowledge, skills, and understand- 
ings, the five response categories were collapsed into three: very high importance, high 
importance, and moderate importance. All the specific items included in the ques- 
tionnaire were at least of moderate importance to teachers with most of them approx- 
imating high importance. Consequently, the knowledge and skills identified can be 
regarded as common to a variety of teaching situations. The five highest ranking items 
and the five lowest ranking items are displayed in Table 2. 

Rated highest in importance by teachers were skills relating directly to the opera- 
tion of a classroom such as organizing for teaching, motivating, and communicating 
with students, and learning from experience. Of less importance, but still important, 
were understanding the organizational structure of the educational system, partici- 
pating in the process of improving schools, and understanding social issues or legal 
aspects of teaching. 
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TABLE 2 


Distribution of Responses on Knowledge, Skills, and 
Understandings Rated Highest and Lowest in Importance (N=515) 


Importance 

Very Moderately 
Knowledge/Skill/Understanding High High High 
High Importance Items 
Organizing myself for teaching 83.9% 15.0% 1.1% 
Motivating students and involving them in learning 81.3 16.7 2.0 
Communicating effectively with students 80.3 18.1 1.6 
Relating to students in a supportive way 67.0 27.8 ay? 
Learning from my own professional experience GiE2 26.6 6.2 
Low Importance Items 
Using community resources to support teaching and learning 2214 39.4 38.2 
Understanding social issues which have relevance for education 21.8 36.3 41.9 
Understanding legal aspects of my work as a teacher 22.3 33.6 44.1 
Participating in the process of improving schools 18.7 37.4 43.9 
Understanding the organizational structure 10.5 25.4 64.1 


of the educational system 


There were marginal differences between different categories of teachers in the 
degree of importance attached to various areas of knowledge, skills, and understand- 
ings. In general, teachers in the elementary grades attached more importance to the 
pedagogical aspects of teaching than did teachers who taught in secondary grades. 
Teachers with a secondary grade level orientation placed more emphasis on the im- 
portance of a broad general education and knowledge of subject matter. Teachers in 
their first year of teaching attached more importance than did teachers in their third 
or fifth year of teaching to learning from their professional experience, managing the 
routine activities of a classroom, and making a commitment to their professional 
growth. 


Potential of Programs 

Respondents also rated the 30 items in terms of the relative contribution made by 
preservice teacher education programs and experience to the development of the 
knowledge, skill, or understanding. During the data analysis, the five response cate- 
gories were collapsed to three: preservice programs make a greater contribution than 
experience, contribution is about equal, and experience makes a greater contribution. 
As might be expected, responses ranged across the three categories for each item; 
nevertheless, there was a marked gradation from item to item in the distribution of 
responses. The seven items on which more than one-half of the respondents believed 
that preparation was a greater source of learning than experience, and the six items 
for which less than 10% regarded preservice preparation to be the major source of 
learning, are displayed in Table 3. 

An examination of these items serves to indicate the general nature of the poten- 
tial which teachers see for preservice preparation programs. Six of the seven highest 
ranking items refer directly to knowing or understanding something: subject areas, 
ethical and legal standards, general knowledge, how children learn, and the structure 
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TABLE3 


Distribution of Responses on Knowledge, Skills and 
Understandings to which Preparation Programs Make 
a Major or Limited Contribution (N=515) 


Source of Learning 


Por Kor 
Knowledge/Skill/Understanding P/E P=E_ E/P 
Major Contribution 
Understanding one or more subject areas in depth T0695 oor 2% 
Knowing the ethical standards of the 66.8 Zep Hey 
teaching profession 
Having a broad general education 66.2 221 Liaw 
Knowing how children develop and learn 65.6 24.5 9.9 
Understanding the legal aspects of one’s work 60.2 Bae rea 
as a teacher 
Having an effective command of 57.3 22.0 20.6 
the language of instruction 
Understanding the organizational structure 52.9 25.9 212 
of the educational system 
Limited Contribution 
Helping students acquire a love of learning 8.6 31.2 60.2 
Managing the routine activities of a classroom 9.9 21.5 68.6 
Relating to students in a supportive way 6.8 24.4 68.8 
Communicating effectively with parents 6.0 16.8 ag 
Relating effectively to other teachers beh 15:3 79.0 
Learning from one’s own professional experience oe 11.4 85.4 


of education. Only one of the items refers to a characteristic that might be considered 
as more skill related. In contrast, all six items on which preparation programs are 
viewed as having minimal potential involve the teacher being able to do something or 
possessing a skill: helping students acquire a love of learning, managing routine activ- 
ities, and relating to others. In these areas of being able to do something, teachers 
identify experience as the major source of learning. Particularly noteworthy is the 
limited potential which teachers attribute to teacher preparation programs for devel- 
oping skill in interpersonal relationships. 

A general indication of the potential which teachers see in preservice preparation 
may be gained by disregarding the P=E percentages and comparing the first and 
third columns across all 30 items. The results of this comparison indicate that 
preservice preparation is regarded as making the greater contribution by a higher pro- 
portion of respondents on nine items; two more are approximately equivalent. For the 
remaining 19 items, the greater proportion of respondents consider experience to be 
the main source of learning. These comparisons suggest that, in relation to the types 
of items included in the questionnaire, teachers see relatively limited potential for 
preservice programs to make a major contribution to their learning. 
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Program Effectiveness 

Responses on program effectiveness ratings were collapsed to form high, moderate, 
and low categories. The proportion of respondents who rated their programs as highly 
effective on specific items ranged from 66.8% to less than 9.1%. Five items were rated 
as highly effective by more than 50% of the respondents and four items were rated 
low in effectiveness by more than 50% of the respondents. These items are displayed 
in Table 4. 

An inspection of the table reveals that programs are most effective in developing a 
range of knowledge items—knowledge of how children learn, of ethical standards, and 
of subject areas. Only one item which may be more skill-related—using a variety of 
methods—appears in the high effectiveness cluster. The four items which programs 
handle least effectively in the opinions of teachers are assessing social and emotional 
needs of students, participating in school improvement, and relating to other teachers 
or to parents. 

In general, preservice preparation programs were viewed as highly effective on rel- 
atively few items of knowledge, skills, and understandings. Although there was a 
range across areas and across responses, the ratings suggest that programs were only 
moderately effective in the views of many teachers. Preservice preparation programs 
appear to be more effective in developing knowledge and understandings than in de- 
veloping teaching skills. For example, programs were rated as more effective on teach- 
ing about how children develop and learn than they were on teaching how to diagnose 
student learning difficulties. 

Variations in ratings of effectiveness reflected differences in preservice prepara- 
tion programs pursued by teachers and in their current teaching situations. In gen- 
eral, teachers with an elementary background rated their programs as marginally 
more effective on pedagogical aspects than did teachers with a secondary teaching 
orientation. 

Although there were relatively few differences across experience categories, 
teachers who were in their first year of teaching tended to rate certain aspects of the 
program more favorably than did those who were in their third or fifth year of teach- 
ing. Among the items on which first-year teachers were more positive about their 
preparation were the following: assessing social-emotional needs of students, evaluat- 
ing students, understanding the organization of the educational system, understand- 
ing legal aspects of education, knowing the ethical standards of the teaching 
profession, developing a personal philosophy, and making a commitment to continu- 
ing professional growth. 

The reasons for the differences between teachers in their first year and those in 
their third- or fifth-year are difficult to identify. However, comments made by third- 
and fifth-year teachers about certain aspects of their programs suggested that at least 
some of the differences may be due to recent changes in preservice programs. 


Program Potential, Effectiveness, and Importance of 

Knowledge and Skills 

An indication of the general relationship among program effectiveness, program 
potential, and importance of items can be inferred from Table 5. The table shows the 
items on which the mean response scores on effectiveness ranked from first to 15th. In 
the second and third columns are displayed the rankings of the mean response scores 
of the same items on program potential and importance to teachers. A comparison of 
the rankings of the same item across the three columns indicates how the effective- 
ness of preservice programs relates to judged potential and importance of particular 
knowledge, skills, and understandings. 
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TABLE 4 


Distribution of Responses on Knowledge, Skills and 
Understandings on which Programs Rate Relatively 
High or Relatively Low in Effectiveness (N=515) 


Effectiveness 

Knowledge/Skill/Understanding High Moderate Low 
High Effectiveness 
Knowing how children develop and learn 66.8% 25.2% 8.1% 
Knowing the ethical standards of the 52.8 aoe3 14.9 

teaching profession 
Having a broad general education 53.6 32.6 13.8 
Using a variety of methods of instruction 51.8 34.6 13.6 
Understanding one or more subject areas in depth 51.9 27.6 20.5 
Low Effectiveness 
Assessing the social and emotional needs of students 14.7 32.8 52.5 
Participating in the process of improving schools sil oles 59.6 
Relating effectively to other teachers 12,1 29.5 58.4 
Communicating effectively with parents 11.6 20.2 68.2 


TABLE5 


Ranking of 15 Knowledge, Skill and Understanding Items 
on Program Effectiveness, Potential of Preservice 
Programs, and Importance to Teachers 


Ranking 


Knowledge/Skill/Understanding Effectiveness Program Importance 


Potential 
Knowing how children learn 1 4 22 
Knowing ethical standards 2 Z 25 
Having a general education 3 3 21 
Using a variety of methods 4 8 TZ 
Understanding subject areas 5 ih 23 
Having command of language 6 6 7 
Understanding legal aspects fi 5 28 
Understanding organization 8 7 30 
Having personal philosophy 4 AL Li 
Planning daily and long-term 10 11 8 
Selecting materials i 12 ll 
Making growth commitment Ne a 18 
Motivating students 13 14 2 
Understanding social issues 14 10 oy 
Making classroom stimulating 15 19 te) 


sen iS 
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The pattern which emerges from these comparisons suggests that preservice prep- 
aration programs tend to be judged most effective in areas of knowledge, skills, and 
understandings in which teachers expect them to make a substantial contribution. 
Included among these are knowing how children develop, knowing the ethical 
standards of the profession, having a broad general education, understanding subject 
matter, and understanding legal aspects of teaching. With few exceptions, the areas in 
which preparation programs are most effective tend to be of only moderate impor- 
tance to teachers in their daily work. Among the exceptions are using a variety of 
methods, having an effective command of the language of instruction, and planning 
on a daily or long-term basis. 

In contrast, preparation programs are only moderately effective or low in effec- 
tiveness in a number of areas which are highly important to teachers. Generally, these 
tend to be skills that they require in the day-to-day conduct of their classrooms such 
as organizing for teaching. For this item, the mean response score ranked 17th on ef- 
fectiveness and 16th on potential but first on importance. Programs were also rated 
low in effectiveness in helping teachers learn how to communicate with, motivate, and 
evaluate students. Teachers attributed relatively low potential for preservice pro- 
grams to help them develop these skills. - 


Contributions of Components 

In Part C of the questionnaire, teachers rated the significance of the contribution 
made by seven components of the preservice program to the development of knowl- 
edge, skills, and understandings which are important to their work. Space was also 
provided for comments. The distributions of the responses across the scale are pre- 
sented in Table 6. Although there are some clear trends, every component was rated 
as either high or low in significance by some teachers. 

Practicum experiences were rated as the most important component of preservice 
preparation programs by a substantial margin over any other component. Approxi- 
mately two-thirds of the respondents gave it the highest possible rating. Comments 
indicated that practicum experiences were rated positively because they are practical, 
provide direct contact with teaching, are clearly relevant to the work for which 
teachers are preparing themselves, and initiate the process through which teachers 
become socialized to the profession of teaching. 

Noneducation courses and general interest courses were rated more favorably as 
components of preservice preparation than were courses offered in the Faculty of 
Education. These courses were viewed positively because they provide a general back- 
ground for teaching and knowledge of the area of subject specialization. Of the educa- 
tion course component, curriculum and instruction and educational psychology 
courses were rated as being more significant to teacher preparation than were educa- 
tional administration and educational foundations courses. 

Curriculum and instruction courses are expected to provide experiences which are 
related directly to instructional tasks in the classroom. Teachers who experienced 
courses which they viewed as practical rated them more favorably than did those who 
regarded them as theoretical or idealistic. Those courses in which professors used in- 
structional methods which were being advocated were rated more favorably than were 
those in which there was a discrepancy between “theory” and actual practice. 

Educational psychology courses were viewed favorably because they contribute to 
developing understandings of students and of learning styles. Favorable assessments 
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TABLE6 


Distribution of Responses on Significance Ratings 
of Components of Preservice Preparation (N=515) 


Significance Rating 


Component 7 6 5 4 3 2&1 
Practicum experiences 67.0% 17.6% 738% 46% 25% 1.0% 
Noneducation courses 24.0 20.8 19.0 15, 8.8 15.8 
General interest courses 13.7 21.9 28.0 19.1 9.1 8.2 
Curriculum and instruction 14.5 20.4 24.1 20.0 9.8 (eZ 
Educational psychology Weg 19F Dawe en, 10.6 WPA 
Educational administration 6.5 133 18.2 Paes 16.2 18.6 
Educational foundations 27, Lg 8.8 16.5 20.7 46.1 


increased when application to the classroom context was demonstrated by the in- 
structor. 

The majority of respondents did not see the relevance of educational foundations 
courses for preparation as a teacher. Relatively infrequent favorable assessments were 
related to the contribution which courses make to general knowledge and to personal 
development. A substantial proportion of respondents regarded educational adminis- 
tration courses as relevant and practical; however, overall assessments were reduced 
for a variety of reasons related primarily to the content of the courses. 


Strengths, Weaknesses, and Improvements 

The strengths and weaknesses of preservice preparation programs were addressed 
both through open-ended questions in the questionnaire and through the interviews. 
Consistent with the responses indicated above, the major strengths of preservice 
preparation programs identified by teachers were related to practicum experiences; 
curriculum and instruction courses followed in second place. There was considerable 
diversity in views of teachers about other strengths of the programs. 

Practicums were described as a strength because prospective teachers appreciated 
the opportunity to have “hands on” experiences, to acquire a store of ideas about 
teaching, and to develop confidence in themselves as teachers. In the views of 
teachers, practicums could be improved through better selection and preparation of 
cooperating teachers, better monitoring of the practicum placements, and greater in- 
volvement of faculty consultants or associates in the practicum. The major suggestion 
offered by teachers for improving preservice preparation was to provide more practi- 
cum experiences and to begin them earlier. 

Curriculum and instruction courses were identified as a strength because prospec- 
tive teachers learned those things which they could apply in the practicums and later 
in their own classrooms. Suggestions for improvement tended to focus on an even 
greater applied orientation in the content as well as on a closer integration of theory 
and practice. 

Those students who experienced good instruction regarded the quality of their 
professors as a strength of the program. Good instructors are those who know how to 
teach and know what teaching in the schools is like. They are able to help students 
prepare themselves for teaching by serving as role models and by exemplifying sound 
instructional practices. On the other hand, professors who are “out of touch with the 
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reality of the schools” were perceived to be a weakness in the program. 

In general, the major shortcomings of preparation programs are the heavy empha- 
sis on what teachers described as theory and the limited opportunities which are pro- 
vided for developing the skills required in teaching. Teachers reported that when they 
began to teach they encountered numerous problems or unknowns. The substance of 
university preparation seemed to be far removed from the realities of classroom 
teaching, according to numerous participants in the study. Teachers would improve 
programs by injecting more of a “how to do” emphasis into preservice preparation. 

Shortcomings were also perceived in the general organization of the program: 
courses were not sequenced appropriately, practicum experiences were not placed at 
the optimum point, assignments were time consuming and trivial, integration of con- 
tent within and across components was lacking, and some mandatory courses were of 
questionable value. To a number of teachers, the logic behind the structure was not 
evident; completing a teacher education program was little more than taking a re- 
quired number of discrete courses. 

Some teachers proposed higher standards for admission and greater selectivity of 
candidates for teaching as a way to improve programs. A few proposed a major pro- 
gram redesign; however, there was no evident consensus on what major changes 
should be introduced. The majority of teachers were supportive of addressing specific 
areas in which they saw possibilities for improvement. 


Discussion 

The similarities between the results of this study and of previous research serve to re- 
mind teacher educators of numerous long-standing issues and problems which must 
be addressed in the search for improved teacher preparation. Among these challenges 
are clarifying the philosophies on which preservice programs should be based, decid- 
ing on the extent to which teacher preparation should be oriented toward practice, 
achieving the integration of theory and practice, improving the nature and quality of 
teaching in preservice programs, and placing reforms in the changing contexts of 
teacher education. Relating the specific outcomes of the study to summaries of issues 
(Hrynyk, 1987) or of research (Wideen & Holborn, 1986) would be productive but is 
beyond the scope of this report. Instead, this discussion will focus specifically on the 
central question: To what extent do preservice preparation programs provide teachers 
with the knowledge and skills which are important for effective teaching? 

From the perspectives of teachers, preservice programs make only a limited con- 
tribution to the development of skills which are essential for effective performance in 
classrooms. Although preparation programs contribute to the development of a broad 
range of knowledge and skills, the greatest contributions are in areas that do not relate 
directly to what teachers consider to be of their greatest significance to them in their 
work. Teachers appear to expect to learn most of these skills through experience; on 
the other hand, they do expect to acquire knowledge of subject matter and of various 
aspects of teaching through preservice preparation. In general, the latter expectation 
is fulfilled to a higher degree than is the expectation to learn about practical aspects of 
teaching. 

The results are consistent with those of other studies which suggest that teachers 
feel inadequately prepared to face the classroom on the basis of their preservice prep- 
aration (Wideen & Holborn, 1986). On the other hand, teachers do not seem to be in- 
tensely negative about their preparation because most of them probably entered 
preservice programs with relatively modest expectations. Nevertheless, in the views of 
teachers, preservice programs could do much more to help prepare them in areas that 
are important to effective teaching. 
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The extent to which preservice programs should and can be made more practical is 
a significant issue for teacher educators. One position holds that preservice programs 
should provide the broad knowledge base on which effective practice can be built 
without becoming immersed in technical matters (Clifton, 1985; Miller & Taylor, 
1984). Others, particularly students (Nixon & Bumbarger, 1984), might hold that the 
need for developing teaching skills or pedagogical insights (Bruneau, 1985) cannot be 
ignored. Still others (O’Neill, 1986) raise the question as to whether universities offer 
teacher training or teacher education programs. Regardless of the emphasis, there is 
need for teacher educators to be concerned about the integration of theory and prac- 
tice as well as about the transition from student to teacher roles (Horowitz, 1984; 
Tardif, 1985; Turner, 1985). 

In the short term, immediate attention might be directed to making explicit the 
rationales and philosophies on which teacher preparation programs are based. The re- 
sults of the study indicate that prospective teachers bring different orientations and 
expectations to preservice teacher preparation programs. Their comments suggest 
that they experience programs differently and they evaluate them in different ways. 
Virtually every aspect of teacher preparation programs is viewed positively by some 
teachers and negatively by others. The sources and degrees of satisfaction vary widely 
from person to person. 

To some extent, the variation may stem from a lack of understanding of the goals 
and purposes of teacher preparation. Simply making goals and philosophies explicit 
will not solve the problem; however, explicit rationales may reduce the discrepancy 
between expectations and experiences. Prospective teachers may learn more from 
their preservice programs if they are clear on what they should and should not expect 
to learn. How these philosophies can best be communicated to prospective teachers is 
a challenge in itself. 


Conclusion 

Although many of the findings of the study will not be new to teacher educators, the 
lack of novelty should not be the basis for disregarding the views which teachers have 
about their preparation programs. Indeed, to have particular strengths identified 
again could serve as reason for accentuating or emphasizing certain desirable prac- 
tices in preservice preparation. Shortcomings on which there is a measure of consen- 
sus as well as suggestions for ways in which preparation programs might be improved 
could serve as a point of departure for considering program change. At the same time, 
there is reason to be cautious about relying too heavily on retrospective assessments 
by teachers of their experiences as a basis for reform in teacher education. Other fac- 
tors must also be considered. 

A recommendation for more research is a predictable conclusion to any such re- 
port. The need for more diversity in research on teacher education has been empha- 
sized by Wideen and Holborn (1986). To the suggestion of adopting a broadened 
perspective on studies of teacher education could be added that of initiating more 
systematic forms of action research or what Stones (1985) refers to as investigative 
pedagogy. Many of the most perplexing issues in teacher education might best be ad- 
dressed by combining research with efforts to improve practice. Action-oriented re- 
search could begin with making purposes and goals explicit, designing specific 
components of programs to achieve intentions and exploring the consequences. Per- 
haps the most productive strategy is not to attempt to develop a generalizable body of 
theory and research in teacher education but to use appropriate modes of inquiry to 
address long-standing and persistent problems in particular teacher education pro- 
grams. 
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Notes 


1. The project was initiated by the Cooperative Committee on Research in Teacher Education (CCRTE) of The Alberta 
Teachers’ Association. Financial support was provided by the Alberta Department of Education; the Faculties of Education 
at the University of Alberta, the University of Calgary, and the University of Lethbridge; and The Alberta Teachers’ 
Association. Members of the Cooperative Committee on Evaluating Teacher Education Programs (CETEP)—Rod Conklin, 
Myrna Greene and Erwin Miklos—served as the Project Team while the CCRTE performed the functions of a steering 
committee from the proposal to the final report stage of the study. 


2. Results of the study are presented in four reports which are available from The Alberta Teachers’ Association, Barnett 
House, 11010-142 Street, Edmonton, T5N 2R1 at the prices indicated: Chapman, (1986)—$5.00; Hawryluk, (1986)—$4.50; 
Hawryluk & Greene, (1986)—$4.00; and Miklos, Greene, & Conklin, (1987)—$7.50. The price for the set of four reports is 


$16.00. 

A monograph (Greene & Miklos, 1987) entitled, Preparation Programs in Alberta Universities: Teachers’ Evaluation, 
which summarizes the study may also be obtained from The Alberta Teachers’ Association at $4.00 per copy. The substance 
of this paper is based on the monograph. 


3. Copies of the questionnaire are included in the summary report (Miklos, Greene, & Conklin, 1987) and in the monograph 
(Greene & Miklos, 1987). 
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PERSPECTIVES 


The Pervasive, Persuasive 
Power of Perceptions 


Neil A. Johnson 


University of Alberta 


Through their influence on attitudes and behavior, perceptions are central to both 
practice and research in education. This article examines some leading developments 
in perception theory and research, particularly in respect of (a) the influence of percep- 
tions on behavior, (b) the accuracy of perceptions, (c) the capacity of individuals to ex- 
press perceptions, (d) perception processes, and (e) major factors influencing 
perceptions of persons and social events. This discussion leads to the presentation of 
seven generalizations about perception and a consideration of their implications for all 
who engage in educational endeavors. Most importantly, it serves as a basis for examin- 
ing means to limit the potentially prejudicial impact of perceptions on educational re- 
search. 


Perceptions shape human attitudes and behavior; their impact is pervasive and un- 
avoidable. They provide bases for understanding reality—objects, events, and the 
people with whom we interact—and our responses to them. Thus perceptions domi- 
nate all the situations that educational and other social researchers study. 
Perceptions also focus the work of the researchers themselves—the behaviors re- 
searchers notice and choose to record, the opinions they invite from others, and the 
theoretical perspectives within which they interpret data and frame conclusions. 

An initial intent of this article is to draw attention to extant knowledge about the 
nature of perception and the factors that affect the perceptions of individuals in edu- 
cational organizations. This provides a basis for advancing a framework of factors as- 
sociated with perception and for considering important implications of perception 
theory for educational researchers and practitioners. Researchers, in particular, need 
to be conscious of the pitfalls inherent in relying on their own and others’ perceptions. 


Mr. Johnson, a doctoral student in educational administration is currently interested in aspects of organizational effectiveness, 
job satisfaction of administrators, and school-based teacher education. 


206 


Power of Perceptions 


The ideas presented were drawn from the literature, from the author’s discussions 
with researchers and educators, and from personal experience. 


The Pervasive Influence of Perceptions on Educational Practice and Research 
Practitioners in educational settings deal with perceptions constantly. They are first 
employed and then evaluated substantially in accordance with perceptual impres- 
sions. Instructional practices, administrative decisions, and policy initiatives are 
guided by perceptions of individuals and educational needs, as are responses to the 
actions of coworkers and others. Knowledge of employees’ and other stakeholders’ 
perceptions helps administrators to revise educational policy and change individuals’ 
experiences in educational organizations, and knowledge of perception theory may 
provide an avenue for directly improving educational leadership and practice. 

Educational research also involves scholars in these situations. Seeking knowledge 
of the interactive behavior, personalities, attitudes, and emotions of practitioners, re- 
searchers enter university and school classrooms, the offices of administrators, and a 
variety of other locations to elicit perceptions from policy makers, administrators, in- 
structors, and students. For those investigators who desire no more than understand- 
ing of attitudes and feelings, perceptions are paramount. And, because behavior 
occurs in response to perceptions, perceptual data also hold a key to researchers’ 
knowledge of organizational behavior. Some scholars use questionnaires and 
interviews to gather perceptual data for quantitative analysis. Naturalistic research- 
ers and policy analysts also rely on perceptions stated by others, and they document 
their own perceptions of organizational events and environments. Researchers using 
interpretive orientations value perceptions as indicators of the realities for individual 
educators. Conceding that an objective reality can be known only through the filter of 
perceptions, researchers of a rationalistic bent also depend on perceptions—as 
surrogate measures of reality. Doubting the trustworthiness of their data, some schol- 
ars explore perceptions still further, using a variety of sources, strategies, and investi- 
gators to corroborate their research findings as true and full representations of life in 
educational settings. 

If educational research is to make important contributions to knowledge, re- 
searchers need to become conversant with the nature and processes of perception and 
with the factors that shape and can bias the perceptions of those engaged in research 
and practice. This will allow them to address personally and methodologically the 
trustworthiness of their scholarly endeavors. 


The Nature of Perception 

A Problem with Definition 

To date, the notion of perception has resisted clear and conclusive definition. Once re- 
garded simply as “sensation plus meaning” (Bartley, 1980, p. 5), most psychologists 
now view sensation as a physiological experience and perception as a cognitive activity 
(Krech, Crutchfield, Livson, Wilson, & Parducci, 1982). Perception and other cogni- 
tive activities such as “inferring, categorizing or judging” (Tajfel, 1969, p. 316). 
Despite this conceptual uncertainty, there is broad consensus that perception is “the 
understanding of the world that you construct from data obtained through your 
senses” (Shaver, 1981, p. 83). As Shaver noted, such a definition implies that percep- 
tions are obtained through sensory experiences rather than merely by reflection or in- 
tuition, that an objective world exists outside the perceiver, and that the perceiver 
actively forms an impression from each stimulus. 
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Developments in Theory 

Investigation of physiological sensory processes (e.g., Bartley, 1972; Hochberg, 1978) 
has been unable to account fully for perceptual experiences, so many psychologists 
have paid closer attention to the complementary psychological dimension. To this 
end, structuralists tried to expose and describe the characteristics of and relation- 
ships between aspects of conscious experience (Dember & Warm, 1979). Gestalt theo- 
rists criticized those attempts to decompose total perceptual structures and focused 
instead on how individuals perceive situations as a whole (Moates & Schumacher, 
1980). Allport (1955, pp. 576-612) extracted from 13 “specialized, divergent, and often 
discordant” theories of his time eight major generalizations (most of which survive in 
current theories): (a) individuals aggregate and interrelate multiple understandings 
of events; (b) these “perceptual aggregates” are organized within limiting conceptual 
boundaries; (c) perceptions are assembled over time; (d) although general order and 
stability prevail, some perceptual inconsistencies are tolerated; (e) perceptions re- 
main relatively constant over time; (f) there is a tendency to return to original “steady 
state” perceptions following the disruption of new impressions; (g) impressions are 
weighed unequally in perceptual aggregation; and (h) although perceptual aggregates 
sometimes conflict, usually they mutually support higher-order perceptual generali- 
zations. Kelly’s (1955) “role-construct theory” emphasized interrelationships among 
the individual’s perceptions, interpretations, and behaviors: By developing “con- 
structs,” or categories of events, the perceiver forms expectations and plans behav- 
ioral responses; and comparison of new perceptions with existing systems of 
constructs validates current understandings and classifications of people, objects, and 
events. Likening perceptual processes with computer system procedures, however, 
“information-processing” theorists contended that perception occurs only after se- 
quential transformation stages: stimulus, sensory store, perceptual image, memory, 
and response. Moates and Schumacher (1980, p. 276) noted that, whereas Gestalt psy- 
chologists “had a remarkable flair for capturing the intuitive aspects,” information- 
processing theory helped to “formalize the sometimes vague gestalt ideas so that they 
are more testable.” The study of “social perception,” as yet still “in its infancy,” focuses 
specifically on three important avenues of research: (a) study of the effects of social 
interaction on perceptions of physical properties; (b) investigation of perceptions of 
individuals’ physical and social characteristics; and (c) exploration of perceptions of 
social relationships and events (Hochberg, 1978). The latter, of primary interest to 
educational researchers, includes the emerging fields of attribution theory and im- 
plicit personality theory. 


The Perception Process 

Perceptions seem to be formed in a series of cognitive steps. A common view is that a 
perceiver selects and categorizes sense data within predetermined structures, or 
frames of reference; in turn, these are subject to attributes of personality (Kelly, 
1980).' According to Bruner’s (1951) “expectancy or hypothesis theory of perception,” 
three stages are involved: the individual hypothesizes about the occurrence of a likely 
event (“hypothesis”), the environment provides an informational stimulus (“informa- 
tion”), and this prompts a confirmatory response from the individual (“confirmation”) 
(pp. 121-127). Moates and Schumacher (1980, pp. 16-17) advanced a more detailed 
explanation: sensory receptors are oriented toward a source of stimulation; certain 
features and contextual factors are extracted (noticed); and then the perceiver 
engages in “a cyclic process of orientation, feature extraction, comparison with mem- 
ory, and then additional orientation, feature extraction, and comparison,” permitting 
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the perception to be incrementally refined. This “chain of perception” process (Krech 
et al., 1982, p. 103) accords with Litterer’s (1973, p. 108) “selection-closure-interpreta- 
tion” explanation and Forgus and Melamed’s (1976) description of a sensation-per- 
ception cycle. 

Four questions about this perception process are of central concern to educators 
and researchers: (a) To what extent do perceptions affect behavior in educational and 
other social settings? (b) Are members of educational organizations able to con- 
sciously identify and express their perceptions? (c) How accurately do perceptions 
portray reality? and (d) Are perceptions shaped by identifiable and commonly 
occurring factors? 


Perceptions and Behavior 

Through a process of “discovering what the environment is really like and adapting to 
it” (Neisser, 1976, p. 9), perceptions shape the social behavior of individuals. 
Perceptions allow individuals to understand, anticipate, and react to environmental 
circumstances, events, and the behavior of others (Forgus & Melamed, 1976; French, 
Kast, & Rosenzweig, 1985; Harvey, Weary, & Stanley, 1985; Wrightsman, 1977). Blake 
and Ramsey (1951), Litterer (1973), and Kelly (1980) also highlighted perceptions as 
the critical determinants of behavior in organizational settings and, although Tagiuri 
(1969, p. 435) warned that other factors also impose on social behavior, he concluded 
that, “if there is to be a science of interpersonal behavior, it will be based, to some ex- 
tent, on our learning more about how people come to perceive other people as they 
do.” Hochberg (1978, p. 242) also regarded the study of perception as “an important 
tool for understanding and predicting behavior” in social situations, and Shapiro and 
McPherson (1987, p. 75) recently focused attention on public policy makers’ percep- 
tions of policy dilemmas as “an important determinant” of their “policy behavior.” 

Understanding of perception is therefore critical for educational research and 
practice, for social behavior in educational settings is guided not merely by an as- 
sumed objective reality but by actors’ individual perceptions and by the factors that 
shape and distort those perceptions. 


Capacity to Express Perceptions 
Though instrumental in determining behavior, many perceptions are beyond the ca- 
pacity of individuals to consciously recognize and verbally express. Cameron and 
Whetten (1983, pp. 12-13), for example, concluded from organizational effectiveness 
research that 
There appears to be ample empirical evidence . . . to suggest that individuals frequently can- 
not report accurately the criteria of organizational effectiveness that they implicitly hold. 
Nor are they aware of the factors that motivate their judgments or evaluations of an organiza- 
tion. When researchers ask various constituency members to specify important criteria of ef- 
fectiveness, there is no assurance that the criteria they enumerate will be consistent with the 
criteria they use implicitly to judge effectiveness. 
Cameron and Whetten supported this view by citing research that asked organization 
members to rank “47 different goals in their institutions”; none received particular 
priority. Clearly, these authors’ assessment was based on an especially onerous de- 
mand on the recall and cognitive organizational capacities of respondents. 
Nevertheless, it does signal a potential danger in seeking perceptual measures of indi- 
viduals’ impressions about abstract organizational attributes. 
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How Accurate are Perceptions? 

On the matter of perceptual accuracy, Rock’s (1975, p. 5) research findings are en- 
couraging: “granting that the perceived world is different from the world that is the 
object of perception, one can still say that there is a high degree of correspondence.” If 
so, we may treat perceptions as broadly accurate indications of the “real worlds” of 
educational organizations. On the other hand, the literature also includes numerous 
references to “illusions” attesting to a clear discrepancy between reality and percep- 
tions of objects. Of course, the accuracy of perception of social events is more difficult 
to measure, for phenomena that defy direct inspection and measurement can be 
known and understood only through a process of perceiving; if individuals’ percep- 
tions of an event agree, we can only assume that they reflect reality (Rock, 1975; 
Shaver, 1975). Even so, Gladstein’s (1984) research and a range of studies reviewed by 
Wrightsman (1977) disclosed a pervasive human incapacity for perceiving and recall- 
ing either objective or social events with accuracy—a finding that supports Allport’s 
(1961) generalization that “good judges” are rare and Blake, Ramsey, and Moran’s 
(1951, p. 5) conclusion that “sometimes... it is not difficult to show that the margin of 
interpretative error is very wide.” Clement (1978) blamed selective perception for this 
apparent disparity—selections that, of course, differ among individuals. Availability 
of information is an associated problem: possessing only fragmentary information, in- 
dividuals are frequently obliged to make perceptual assumptions and hold personal 
expectations which lead them to perceptions that diverge from reality and from those 
of other witnesses (Hochberg, 1978; Litterer, 1973). Hochberg (1978) further identi- 
fied three common causes of perception-reality discrepancy: (a) events that cannot be 
discerned; (b) omissions, additions, and distortions arising out of human perceptual 
processes; and (c) events whose significance is misunderstood. Differing histories of 
perceptual learning, attention, and intentions also lead individuals to form different 
impressions of events and persons. 

As much of the research and our personal experience tell us, individuals’ percep- 
tions of social events frequently differ. For example, we may expect an instructor, an 
inquiry-oriented student, and a student motivated only by threat of examination to 
perceive the purpose, product, and quality of a shared seminar experience quite dif- 
ferently. Hence educational practitioners and researchers would do well to draw on a 
variety of perspectives and to recognize the limitations of their own and others’ per- 
ceptions when forming impressions and making decisions. For many researchers, in 
particular, reliance on multiple data sources may increase the convergence of percep- 
tions and thus the likelihood of discovering a shared or objective reality. 

Despite inaccuracy, there is often considerable consistency in social perceptions as 
well as in the biases that cause individuals’ perceptions to depart from reality and the 
impressions of others. Perception research has disclosed a range of central influ- 
ences—summarized in Figure 1 and discussed below—that can help us to understand 
and account for the effects of perceptions on educational research and practice. 


Factors that Affect Perceptions 
Perceivers arrive at their limited, differing, sometimes distorted impressions of per- 
sons, objects, and events only after influence from a range of complex and subtle fac- 
tors. Despite an initially bewildering array of influential factors, certain “perceptual 
determinants” emerge as common among perceivers (Blake et al., 1951, p. 8). Scholars 
have classified them in a variety of ways, such as familiarity, strength, and salience of 
stimuli (Kelly, 1980) and “experience, intelligence, and empathic ability” of perceivers 
(Shaver, 1975, p. 22). Most comprehensive are Forgus and Melamed’s (1976) four 
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Figure 1: The Perception Process and Influential Factors 
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categories: (a) influences of social experience and cultural background; (b) impact of 
the perceiver’s values, attitudes, and personality; (c) dynamics of person perception; 
and (d) perceptions of causality in social events. 


Social and Cultural Experience 

Social/organizational experience and cultural heritage are widely accepted as impor- 
tant determinants of perceptions. Clement (1978, p. 51), for example, argued that in- 
dividuals process and categorize environmental occurrences according to existing 
“yerceptual structures”—structures based on prior experience that are expanded and 
altered by the new perceptions. To Wrightsman (1977, p. 95), these perceptions de- 
velop as if “peeling an onion.” Though the weighting of new and existing pieces of in- 
formation is unclear, first impressions and recent and frequent events seem to impose 
heavily on subsequent impressions—except where perceivers consciously resist their 
prejudicial influence (Epstein & Rock, 1960; Krech et al., 1982; Litterer, 1973). Com- 
mon work and other experiences also lead individuals to form similar perceptions 
(Litterer, 1973) and there is evidence that social contact and familiarity with different 
classes of persons and situations, group norms, personal self-confidence, and linguis- 
tic classificatory labels also affect perceptions (Tajfel, 1969). 

Furthermore, perceptions of facial and other perceptual cues seem to be culturally 
influenced; individuals from different cultures sometimes express emotion differ- 
ently, so their cues are open to misinterpretation (Brunswik, 1944; Tagiuri, 1969). Of 
course, the discipline of anthropology also has an extensive literature attesting to the 
pervasive and powerful effect of culture on individual perceptions and on resultant 
“appropriate” behavior. According to Murphy (1979, pp. 14, 22-23), the “majority view 
among social anthropologists” favors substantial cultural determination of individual 
perceptions and actions. By shaping perceptions, culture “tells us how we should act 
... [and] what we can expect of the other person. Behavior is thus rendered predict- 
able, often within broad limits, and we gain a degree of mastery and confidence in so- 
cial situations.” Likewise, Vivelo (1978, pp. 16-17) saw culture as a “system of rules or 
a pattern for behavior—a conceptual pattern that individuals use to explain and jus- 
tify their experiences or events.” Again, Geertz (1979, pp. 43, 46-47) proposed culture 
as “a set of control mechanisms—plans, recipes, rules, instructions” that impose on in- 
dividual perspectives and behavior. At the same time, Geertz focused attention on 
culture’s diverse effects on individual perceptions and behavior: “Becoming human is 
becoming individual, and we become individual under the guidance of cultural pat- 
terns.” 


Organizational Factors 

Litterer (1973) highlighted certain characteristics of organizational life that affect 
members’ perceptions. Particularly important is organizational socialization: col- 
leagues, superordinates, and subordinates instruct perceivers about physical, finan- 
cial, personal, and other signals, thereby educating them about prevailing views of 
reality and encouraging them to behave accordingly. Individuals—particularly those 
who lack self-confidence or who value social acceptance—are persuaded to accept, or 
at least appear to accept, the “group consensus,” especially in ambiguous social set- 
tings. Perceptions are further affected by the nature of organizational interaction, by 
authoritative or participative leadership, by expected and actual role behavior and 
statuses, and by rewards for creativity and achievement; and work stress can prompt 
unduly early and inaccurate impression formation. The “reference groups” individ- 
uals use to judge norms of acceptable behavior and to provide bases for comparison 
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also influence perceptions and behavioral responses. As French et al. (1985) added, 
groups frequently distort perceptions about opposing groups with competing inter- 
ests, leading to mutual mistrust, suspicion, power-consciousness, competitive behav- 
ior, and misunderstanding of others’ intentions. 


Personality Characteristics 

Many personality variables have emerged as important influences on individuals’ so- 
cial perceptions; beliefs, needs, emotions, attitudes, values, and expectations of 
perceivers are frequently cited (French et al., 1985; Kelly, 1980; Krech et al., 1982). 
Central, however, is Tagiuri’s (1969) classification of the personality-related 
“judgmental errors” (Krech et al., 1982, p. 705) identified by Bruner (1951) and other 
researchers during the 1950s and 1960s: (a) “halo effect,” or a tendency to form global 
impressions based on overall judgments of goodness or badness (such as where a 
teacher obtains an initially unfavorable impression of a colleague or student and at- 
tributes other undesirable characteristics to that person); (b) “logical error,” or as- 
sumed coexistence of particular traits (where a pupil assumes, say, that a teacher’s 
energy will be accompanied by aggressiveness); (c) “leniency effect,” or a tendency to 
rate persons high on favorable attributes and low on unfavorable ones; and 
(d) “assumed similarity,” where other persons are assumed to have emotions and 
traits akin to those of the perceiver. Modern writers on perception commonly add a 
fifth factor: “stereotyping,” that is, “a tendency to assign to people many of the charac- 
teristics (real and imagined) of the typical member of the group or class to which they 
are thought to belong” (Krech et al., 1982, p. 705). For this reason, a professor might 
be classified as absent-minded or a school principal regarded as an expert teacher 
merely on account of their status. 

Other research has explored the intervening nature of personality through four re- 
lated factors: “perceptual constancy,” “perceptual defence,” “recognition threshold,” 
and “preperceptual selection.” Individuals’ stable and consistent perceptual 
frameworks are shaped and reinforced through heightened awareness and enhanced 
valuation of prominent and personally relevant events (Bartley, 1980; Kelly, 1974). 
To help preserve perceptual “balance,” perceivers also “intellectualize” about discord- 
ant events, shaping them to fit existing understandings (Heider, 1980, p. 10; Litterer, 
1973). Indeed, perceptual defences may cause perceivers not merely to amend inter- 
pretations of items but to defensively repress contradictory information from con- 
scious recognition. Paradoxically, selection necessitates prior recognition 
(preperception); as Hochberg (1978, p. 216) explained, “the self we are aware of... 
sees only what the unconscious observer permits him to see.” Less absolute than per- 
ceptual defences are recognition “thresholds.” Perceivers readily identify important 
and consistent (low threshold) stimuli; incongruent items can also be recognized, but 
only after blatant or extended exposure.” 

Additional personality factors intrude in the formation of perceptions. “Field-in- 
dependent” persons retain their preconceived mental sets even in the face of new and 
conflicting evidence: new events all too easily convince “field-dependent” individuals 
to amend their perceptions (Hamachek, 1970). High intelligence (Kelly, 1974), “cogni- 
tive complexity” and maturity (Bieri et al., 1966; Wrightsman, 1977), inferential abil- 
ity and knowledge of persons or situations (Gage & Cronbach, 1955), emotional 
adjustment, interests, and fields of specialization and social interaction (Taft, 1955) 
also affect judgments of people and social events, as do the personalities of stimulus 
persons, the nature of the judgmental tasks involved (Tagiuri, 1969) and the 
judgments and behaviors of relevant social groups (Asch, 1960; Deutsch & 
Gerard, 1960). 
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Perceptions of Persons 

Perceptions of persons and their relationships and behavior are more complex—and 
often less accurate—than those of inanimate objects and other social events; for per- 
son perceptions embody impressions of “intentions, attitudes, emotions, ideas, abili- 
ties, purposes, traits, thoughts, perceptions, memories—events that are inside the 
person and strictly psychological” (Tagiuri, 1969, p. 396). Three aspects of “person 
perception” are important: (a) perceptions of observable behavior and attributes; 
(b) assignment of attributions to explain behavior; and (c) use of “implicit personality 
theories” to form impressions. 


Perceptions of Observed Behavior and Attributes 

Individuals utilize a range of cues and information sources to understand others’ feel- 
ings, intentions, and relationships; extrinsic attributes of perceived individuals and 
the personality types, traits, prejudices, and stereotypes of perceivers themselves are 
powerful forces (Hochberg, 1978). Facial and physical characteristics of perceived 
persons provide particularly strong and consistent—but inaccurate—signals about 
personalities and emotions (Tagiuri, 1969); facial expressions seem to be interpreted 
similarly even across cultures (Ekman, Sorenson, & Friesen, 1969; Izard, 1971). 
Posture and position provide cues about both positive and negative attitudes, and 
averted gaze and abnormal mannerisms such as hesitation and silence create negative 
impressions of persons (Mehrabian, 1972). The implications of these findings for per- 
ceptions of self-conscious pupils, socially inept administrators, and unconventionally 
attired instructors are clear. Tagiuri (1969) noted that perceptions of persons are fur- 
ther complicated by the influences of the halo effect, logical error, and other factors 
referred to earlier, by role relationships, status differences, and other contextual vari- 
ables, and by the differing perceptual skills needed to gain impressions of individuals, 
groups, and social events. Through “dyadic interplay,” perceivers’ own actions also af- 
fect the behavior, and thus perceptions, of those they seek to appraise: 

The perceiver ... has to allow for the fact that he himself... is also the object of perception 

and thought and that, as such, he influences his own object of perception. Observer and ob- 

served are simultaneously observed and observer. Their reciprocal feedback processes mod- 
ify their self-presentation and, in turn, their reciprocal perceptions, in a continuous recycling 
but varying process during which each person uses the variations in himself and the other 

person as a means of validating his hypotheses about the other. (Tagiuri, 1969, p. 426) 

This quotation reminds us also of the array of “interaction ritual” and “impression 
management” strategies that we employ daily to influence the perceptions of others in 
the university, school, or office (Goffman, 1967; Schlenker, 1980). 

To deal with all of this complexity, perceivers stereotype groups of persons and so- 
cial settings; perceptions of faculty members, say, tend to be affected by impressions 
of outstanding professors—although observers’ confidence in those judgments dimin- 
ishes with the discovery of intragroup variation. More generally, four “experiential 
principles” seem to guide most perceptions of behavior and attributes: (a) initial 
person perceptions are “gut-level,” generally positive or negative feelings that usually 
are subsequently confirmed; (b) perceivers initially notice and seek explanations for 
unusual attributes or behaviors; (c) first impressions focus on observed characteristics 
and behaviors, although perceivers quickly progress to perceptions based on personal- 
ity traits; and (d) individuals view other persons in terms of generally “unified, organ- 
ized collection[s] of traits that usually ‘hang together’ (Krech et al., 1982, 
pp. 699-700). 
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Rist’s (1970) classic study of teachers’ expectations and student achievement 
graphically illustrates the profound educational impact of perceptual influences such 
as the halo effect and first impressions. Rist observed that a kindergarten teacher’s 
classification of pupils as “fast learners” and “slow learners” was based on five initial 
“subjectively interpreted attributes and characteristics” of students—physical ap- 
pearance, body odor, skin color, forthrightness of behavior, and use of language in 
classroom interaction. Having classified the pupils according to her social class-based 
“ideal type” of the successful student, the teacher (and soon also the students) treated 
the two groups differentially—encouraging one and controlling the other—and their 
performances diverged accordingly. At succeeding grade levels, use of these perform- 
ance measures helped to reinforce the self-fulfilling prophecy that had arisen out of 
those initial perceptions. 


Attributing Cause to Explain Behavior 

According to proponents of “attribution theory,” perceivers not only observe and de- 
scribe persons and their behaviors, but they often seek greater understanding by at- 
tempting to determine causes for behaviors (Shaver, 1975). Attribution theory has 
attracted attention among social psychologists since the 1970s (Hewstone, 1983): edu- 
cational researchers continue to ignore this avenue for investigating perceptions and 
behavior among educational policy makers, administrators, educators, and students 
(Frasher & Frasher, 1981). “An attribution is an inference about why an event 
occurred or about a person’s dispositions” (Harvey & Weary, 1981, p. 6). 
Wrightsman’s (1977, p. 100) illustration is instructive: 

People perceive the behavior of others as being caused and... they attribute the cause either 

to the person, to the environment, or to a combination of the two. For example, Joe failed to 

call me last night. I conclude that either he forgot (called a personal or dispositional attribu- 
tion) or his car stalled and he couldn’t get to a phone (environmental or situational attribu- 
tion) or he was mad at me but when he got over being mad the phone at his house was tied up 

(combination of dispositional and situational attributions). 

Apparent accidents, involuntary actions, and unintended outcomes usually require no 
attributions of cause, and adherence to social norms and conventions also explains 
many recurring behaviors. Unanticipated and seemingly intentional behaviors, how- 
ever, call for attribution of a personal trait or external cause; the observer thereby ac- 
quires an understanding of the actor and/or situation—one that may create a lasting 
impression about the other individual’s disposition and/or about their social circum- 
stances (Krech et al., 1982; Shaver, 1975, 1981). 

Heider (1985) identified five main strategies for determining cause: (a) “Trying” — 
behavior that reflects exertion and a clear intention is usually regarded as personally 
caused (e.g., an educator who displays considerable expenditure of energy on a worth- 
while project is likely to be judged responsible for its success); (b) “Ability’—actors 
are held responsible if they seem to have the requisite intellectual, physical, or social 
ability to produce the outcomes observed (e.g., a seemingly talented professor or 
teacher might be regarded as the cause of high grades in a class); (c) “Unusual” or “low 
consensus” behavior—surprising occurrences tend to be blamed on the actors in- 
volved; (d) “Distinctiveness”—the more frequently a person acts in a recognized way, 
the more likely the behavior is to be viewed as a personal characteristic (e.g., if an ad- 
ministrator repeatedly praises all staff members for their achievements, observers 
may view the commendation as indicative of the administrator’s flattering or 
manipulatory disposition: if the administrator praises staff selectively, then an 
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external factor—the subordinate(s) singled out for approval—is likely to be held re- 
sponsible); and (e) “Task difficulty’—a highly complex task, or one that is resisted by 
the social environment, is more likely to be attributed to the actor (Krech et al., 1982; 
Shaver, 1981). In addition, a perceiver may consider not only the reasons for an ob- 
served behavior but alternative behaviors that the actor elected to forego (Jones & 
Davis, 1965). Behavior that seems unconventional gives the perceiver “high corre- 
spondence”—confidence of knowing what unusual personality characteristic caused 
it. For example, a principal who sharply criticizes district policies at a meeting of 
school trustees may be confidently presumed to possess strong personal attitudes 
about those policies. Finally, the perceiver may search for the source of cause-effect 
“covariation” (Kelley, 1967)—“the causal candidate which is most closely associated 
historically with the event being explained” (Antaki, 1982, p. 11). Three personal and 
environmental elements are taken into account: (a) the observer infers personal at- 
tributions from “distinctive,” or uncharacteristic, actions; (b) an explanation is de- 
rived by examining the degree of “consistency” or variation in behavior across 
differing circumstances; and (c) “consensual” behavior by other actors indicates to the 
observer that environmental factors caused the behavior. 

Other factors intervene in the attribution process. Socially undesirable acts 
prompt strong personal attributions, and unfavorable information about behavior 
amplifies observers’ interest in identifying causes (Wrightsman, 1977). “Personal mo- 
tives” make certain kinds of observers especially subject to attributional bias: those 
who believe “people get what they deserve” consistently blame others for their 
misfortunes; perceivers who anticipate similar fates usually blame circumstances 
(Shaver, 1981, p. 138). Trained professionals arrive at distinctive attributions for 
events within their fields of expertise, probably owing to their exclusive knowledge 
(Frasher & Frasher, 1981). There are also interactive effects among these factors. If 
parents or administrators expect high-performing or high-ability teachers to succeed, 
subsequent high attainment is likely to reinforce opinions about the teachers’ high 
ability whereas failure may be attributed to their lack of effort. 

Our attributions often result in erroneous perceptions. Actors are conscious of en- 
vironmental constraints and circumstances involved in making behavioral choices: 
observers, however, usually have to speculate about the actors’ “circumstances, his- 
tory, motives, and experiences,” so they focus on and over-attribute behaviors to per- 
sonal factors (Jones & Nisbett, 1971, p. 5)—“the fundamental attribution error” 
(Ross, 1977, p. 183). 

We also engage in self-attribution to form impressions about ourselves.’ Knowing 
our own contextual factors, we tend to “blame the environment” (e.g., unsupportive 
politicians) for our own behavior: observers blame our personal characteristics (e.g., 
political naiveté, Jones & Nisbett, 1971, p. 2). Moreover, feeling confident that exter- 
nal circumstances are responsible, we rarely seek confirmation of self-attributions 
(Olson & Ross, 1985). Further self-attribution bias arises out of our explanations of 
academic, vocational, and social successes and failures: We tend to blame external cir- 
cumstances for our own failures but see ourselves as instrumental in successes 
(Shaver, 1981; Wrightsman, 1977)—an “ego-protective” distortion of attributions 
(Hewstone, 1983, p. 3). In this regard, Rogers (1982, p. 231) reported an observed 
tendency for classroom teachers to describe pupils as “odd,” “disturbed,” or “peculiar” 
when explaining their students’ failures. Self-attributions also affect our behavior: at- 
tributions to “stable,” or long-lasting, qualities such as ability or hard work motivate 


future effort; attributions to transient personal factors, such as luck, mood, or fatigue, 
do not (Weiner, 1979). 
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In summary, Krech et al. (1982, p. 715) concluded from their review of research 
that, where perceivers witness pleasurable or undesirable behavior, are affected by be- 
havior, place great importance on behavior, or observe their own behavior, they “are 
likely to both see and believe what [they] ‘want’ to, and . . . attribute accordingly.” 
Furthermore, the greater the involvement of perceivers in behavior, the more likely 
perception bias—and conflict of perceptions—is to arise and the stronger is its prob- 
able effect. Despite current criticism of attribution theorists’ reliance on rationality, 
predictable bias, and human desire to investigate and process social interaction infor- 
mation (Hewstone, 1983; Lalljee & Abelson, 1983), attributional principles have sub- 
stantial relevance for educational settings: They signal a need for open 
communication within faculties, staffrooms, and classrooms and help in understand- 
ing educators’ and stakeholders’ explanations for organizational and personal 
successes, failures, and behavior. And as Frasher and Frasher’s (1981) novel frame- 
work indicates, attribution theory also holds important insights for educational re- 
searchers. 


Implicit Personality Theories 

Although observations and attributions help in the development of perceptions, the 
perceiver finally relies on a “set of expectations about which personality traits will be 
mutually associated” to form organized impressions about others (Shaver, 1981, 
p. 189). Two principles of perception are involved: (a) despite the incompleteness of 
available and observed data, the perceiver develops consistent perceptions (“forms 
impressions”) of other persons by identifying and stressing certain perceived attrib- 
utes and inventing others to complete the picture; and (b) the major determinant of 
impressions of persons is not external stimuli but each perceiver’s personal, unstated 
(“implicit”) theory about the traits associated with different personality types (Krech 
et al., 1982). The perceiver categorizes other persons primarily according to physical 
characteristics, temperament, intelligence, and socioeconomic standing; other attrib- 
utes that are assumed to follow are conferred accordingly. The perceiver also classifies 
each attribute on the dimensions of goodness-badness, strength-weakness, and 
activity-passivity (Osgood, Suci, & Tannenbaum, 1957). Consequently, on the basis of 
observed behavior, the perceiver can both form a unified impression about the actor 
and rate that person on each of the three dimensions. 

In a study of sales teams in the communications industry, Gladstein (1984) found 
implicit theories of leadership traits to be important in the formation of impressions 
among team members. Furthermore, Gladstein’s “most striking finding” was that 
these theories had led individuals to erroneous perceptions of coworkers’ personalities 
and, hence, to unfounded explanations of their successes or failures. Similarly, in an 
educational context, a junior postsecondary instructor might confer on a colleague of 
high status the traits of coldness and insincerity. The worker may believe that these 
traits are not only consistent with high status but indicative of, say, relative badness, 
extreme strength of personality, and high activity. Despite the instructor’s scant 
knowledge of that colleague, an impression is formed—one that is likely to be not only 
partially unfounded but subject to observational error based on first impressions, at- 
tribution biases, and so on. In this way, implicit personality theory highlights the im- 
portant role of educators’ predetermined beliefs about personality in the formation of 
impressions of persons, as well as the constructed unity of those impressions. 
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Implications for Education 
Research on perception highlights the powerful, unavoidable impact of personal per- 
spectives on all activities—including educational practice and scholarly endeavor. 
The following seven generalizations highlight for educators, administrators, research- 
ers, and readers of research reports central aspects of that impact. 


1. The perception process comprises a number of stages: stimulus, sensation, com- 
parison with an implicit theory of personality and existing perceptual knowledge, 
attribution of cause for personal behavior, and impression formation/amendment. 

2. Perceptions, rather than objective reality, at least partially shape individual atti- 
tudes and behavior. 

3. Perceptions are frequently no more than approximations of reality. 

4. Individuals are incapable of recognizing and verbalizing some perceptions. 


5. Perceptions tend to develop as internally consistent and temporally stable struc- 
tures. 


6. Perceptions are affected substantially by an extensive but recurring array of fac- 
tors arising out of social and cultural experiences, organizational attributes, and 
personality characteristics. 


7. Additional factors, such as first and recent impressions, impose on perceptions of 
persons. 


Implications for Practitioners 2 

Such knowledge of perception may be seen by some policy makers, administrators, or 
other practitioners as “a tool to predict and control human behavior” (Hochberg, 1978, 
p. 213). For example, by manipulating organizational structures, social relationships, 
and events, and by displaying appropriate personality characteristics and communi- 
cating suitable information, administrators may attempt to shape the perceptions 
and resultant behavior of subordinates, sponsors, and other stakeholders. Litterer 
(1973) and Bosetti (1973) defended this use of “administrative apparatus” to integrate 
discrepant perceptions and efforts in schools and other organizations. Bosetti also saw 
a place for administrative intervention to relieve dysfunctional stress and its 
symptom, rapid perceptual closure. 

Alternatively, educators and others may utilize their knowledge of perception the- 
ory more personally to improve their perceptual skills and interactive behavior in 
order to approach their daily tasks and social interaction more fully and more accu- 
rately informed. Initially, they need to be aware of the factors that distort impressions 
of people and bias judgments of events. For example, perception theory informs us 
that the quality of educational evaluations and decisions about personnel and other 
matters may be enhanced if perceivers resist perceptual influences such as the halo ef- 
fect, logical error, and assumed similarity. By admitting and weighing evidence that 
contradicts existing images of persons or events, overall perceptions can be tested, re- 
vised, and refined. The principles of implicit personality theory and attribution the- 
ory also highlight the prevalence and sources of differing and inaccurate perceptions; 
they remind practitioners of the importance of frank, regular, organization-wide com- 
munication and of the need to watch for bias in their own and others’ causal explana- 
tions of organizational performance, individual action, and conflict. Particular 
attention should be paid to the reasons they use to justify their own behavior, atti- 
tudes, plans, achievements, and failures, and they need to bear in mind their potential 
as observers for erroneously holding others fully accountable for their behavior. The 
foregoing generalizations and discussion highlight other important factors for which 
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professors, teachers, and administrators need to be alert. 

Beyond awareness, practitioners can act directly to improve the quality of their 
perceptions—and, therefore, the appropriateness of their behavior—in educational 
contexts. First, resisting undue influence from initial and recent impressions, educa- 
tors can consciously and systematically consider issues, persons, and events from a 
variety of standpoints. In this regard, Morgan’s (1986) advice for sound organizational 
management can inform educational personnel. Morgan argued that effective profes- 
sionals possess a capacity to “read social situations,” whereby they intentionally 

remain open and flexible, suspending immediate judgments whenever possible, until a more 

comprehensive view of the situation emerges. They are aware of the fact that new insights 
often arise as one reads a situation from “new angles,” and that a wide and varied reading can 

create a wide and varied range of action possibilities. (p. 12) 

Indeed, Morgan’s elaboration of disparate metaphorical images for viewing organiza- 
tions affords a framework of multiple, divergent perspectives that administrators and 
others may find helpful in their effort to view events and persons in diverse ways. 

Second, educators need to test their perceptions against those of colleagues, stu- 
dents, administrators, and stakeholders with whom they associate; this calls for a con- 
certed strategy of frank, nonjudgmental communication in the school, university, or 
boardroom setting, with regular invitations to others to express support or present 
contrary perspectives. As an example, Kurmey (1986, pp. 3-4) commented that 
school-level evaluators who “actively solicit the perceptions of those whom they are 
evaluating” can improve understanding, communication, and the quality of evalua- 
tions, and can relieve teachers’ work stress and feelings of anxiety about supervision. 

Third, practitioners need to make critical thinking a feature of their professional 
work lives. Even more than other professionals, educators are susceptible to the con- 
fining socialization of schooling, teacher education programs, and practicum, intern- 
ship, and initial teaching experiences (Lanier & Little, 1986; Patterson & Miklos, in 
press; Tardif, 1985; Warren, 1985) that imprisons fledgling professionals within con- 
ventional views of education and teaching and discourages analysis and critical reflec- 
tion. Strategies for fostering critical reflection in education faculties and schools have 
been proposed elsewhere (Johnson, 1987). It is sufficient to note here that academics, 
policy makers, teachers, and administrators must bear individual responsibilities for 
exposing, discussing with colleagues, and habitually questioning existing educational 
values and theories and the instructional practices to which they subscribe. 


Implications for Researchers 

Perceptions also impose on educational inquiry in two critical ways. First, researchers 
draw heavily on the perceptions of individuals associated with educational organiza- 
tions to learn about life in those contexts, searching either for proxy measures of an 
objective reality or for expressions of disparate realities for individuals. In either re- 
spect, perceptions have much to reveal about meaning and behavior in educational 
situations. At the same time, the perceptions gained from selected informants afford 
only partial views of the reality perceived by participants—perceptions prejudiced by 
all the prior social and cultural experiences, organizational factors, personality char- 
acteristics, and other factors that bias and otherwise shape individuals’ expressed at- 
titudes and behaviors in diverse, immeasurable ways. 

Second—less obvious, but equally important—is the impact of perceptions on re- 
searchers themselves. Researchers’ own perceptions shape their theories about life in 
educational organizations, prompt them to identify and regard certain kinds of activ- 
ities and ideas as worthy of study, bias their selections among available methodologi- 
cal techniques, and influence the data they collect from educators and other 
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stakeholders. In the first place, paradigmatic stances direct and limit researchers’ foci 
of inquiry; writers such as Burrell and Morgan (1979), Owens (1982), and (in the cur- 
rent volume of The Alberta Journal of Educational Research) Jacknicke and Rowell 
(1987) have made this abundantly clear. One researcher, for example, may approach 
the study of a school or higher education setting from the standpoint of a traditional 
“rationalistic paradigm” or “empirical-analytic inquiry orientation.” As Jacknicke and 
Rowell (1987) explained, “The researcher within this orientation sees a classroom ex- 
isting as a set of variables, and .. . attempts to explain and predict behaviors in terms 
of these variables” (p. 65). A second researcher, adhering to an “interpretive” or “natu- 
ralistic” orientation, may investigate the same social situation expecting to find so- 
cially constructed, multiple realities. This researcher may accept what Owens (1982) 
labeled the “naturalistic-ecological hypothesis which claims that human behavior is so 
significantly influenced by the context in which it occurs that regularities in those 
contexts.are often more powerful in shaping behavior than differences among the in- 
dividuals present” (p. 5); or the researcher may emphasize individuality, believing 
that “What is true for one person is not for another. In that sense we live in different 
worlds” (Greenfield, 1982, p. 5). In either case, the second researcher will see realities 
as multiple and context-bound and will be concerned, as Jacknicke and Rowell (1987) 
observed, to “make explicit the expectations, attitudes, motives, and common 
meanings that teachers and students have in specific situations” (p. 67). A third re- 
searcher, subscribing to a “psychic prison” (Morgan, 1986), “critical theory” (Bates, 
1986), or other “radical humanist” (Burrell & Morgan, 1979) orientation, may be led to 
explore the social metaphors and assumptions that individuals use to dominate 
others, express their interests, and otherwise cope in the social setting of concern. 
Jacknicke and Rowell (1987) noted that, with this guiding orientation, “concerns re- 
lating to epistemology, power relationships, root metaphors, politics of schooling, and 
other factors that interplay to construct that phenomenon we know as education need 
to be examined and exposed” (p. 68). Clearly, the three researchers’ orientations will 
direct them toward different issues and divergent explanations of circumstances and 
events. More than this, through their emphases on and enhanced responsiveness to 
specific dimensions and views of organizational life, their limited foci are attended by 
a potential for misconstruing actions and expressions of opinion accordingly. 
Morgan’s (1986, p. 197) discussion of the political metaphor stressed this potential for 
a chosen orientation to bias the perceptions of an observer: “Under the influence of a 
political mode of understanding everything becomes political. The analysis of inter- 
ests, conflicts, and power easily gives rise to a Machiavellian interpretation that 
suggests that everyone is trying to outwit and outmaneuver everyone else.” Again, a 
radical humanist perspective might lead a researcher to perceive erroneously that an 
individual’s action or statement is fundamentally an expression of domination; con- 
versely, to ignore this perspective may also be to overlook an important explanation 
for that behavior or expression of opinion. 

Also critical for the quality of research is the inescapable influence of every 
researcher’s perceptions on the processes of data collection, analysis and interpreta- 
tion, and subsequent reporting of results. Initially, research methodologies are fre- 
quently dictated not only by extant knowledge and needs for further study in 
particular areas, but also by the preferences, interests, and self-perceived capacities of 
individual researchers. For those who pursue quantitative studies, data collection is 
subject to researchers’ prior judgments about aspects of phenomena that are worthy 
of attention. These judgments influence the design and wording of questionnaire 
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items and the construction of interview schedules and management of interviews. 
Thereby, researchers’ perceptions limit the range of subsequent responses. 
Perceptual biases also impose on analyses of those data, including selections among 
available statistical and other techniques, and on the arrangement and reporting of 
findings. For policy analysts, perceptions direct the selection of respondents, identifi- 
cation of focal issues, choices among available documentary sources as well as the rela- 
tive weightings and meanings assigned to various stakeholder’s expressions of 
opinion. Other researchers, trying to minimize the confining influences of existing 
theoretical knowledge and personal belief, prefer to produce detailed descriptions of 
events, statements, and other observations in specific contexts and to allow these raw 
data to direct the development of theoretical frameworks for understanding life in 
those contexts. Even so, few would deny that the favored research perspectives of 
these central research “instruments” (the researchers), their selective perceptions and 
explorations of events, and their heightened awareness of and identification with par- 
ticular informants and viewpoints all vitally affect data collection as well as subse- 
quent efforts to ascribe meaning and significance and to extract themes from 
voluminous, often conflicting, data. 

What can researchers do to overcome—or at least minimize—the threat of preju- 
dice arising from reliance on their own and others’ perceptions? Initially, it should be 
admitted that perceptions, as powerful, pervasive guides to social behavior, are 
valuable objects of scholarly attention. Moreover, the first issue—informants’ percep- 
tions—is of less concern to researchers who subscribe to a strictly individual concep- 
tion of reality for every respondent and informant; diverse perceptions are seen as 
indicative of diverse realities. Even so, these researchers must have regard for the 
faithfulness of the perceptions obtained. The concern remains central, however, for 
the majority of educational researchers, each of whom searches either for an objective 
reality or for a realistic account of a shared reality for inhabitants of a chosen educa- 
tional setting. The trustworthiness of perceptions acquired by researchers can be 
actively addressed in two ways. First, researchers are better prepared to deal with per- 
ceptual bias and associated difficulties if they are conversant with and watchful for 
the factors (discussed above) that impose heavily on informants’ perceptions and, 
thus, upon the quality of data acquired. Second, naturalistic researchers in education 
and scholars in other disciplines have devised useful strategies for enhancing and es- 
tablishing the trustworthiness of perceptual data; these are worthy of consideration 
by all who engage in research with informants and respondents. (These strategies are 
noted in subsequent discussion.) 

The constraint caused by factors influencing researchers’ own perceptions is more 
subtle and, hence, more difficult to manage, yet it remains a critical constraint on the 
quality of all educational research. This potential for perceptions to shape—or, more 
disturbing, to misshape—outcomes of educational research is clear. Here again re- 
searchers bear dual responsibilities—the first involving perceptual self-awareness. 
Researchers need to acquaint themselves with the factors that influence perceptions 
and should commit themselves to introspection and frank appraisal and disclosure of 
their own research orientations, methodological biases, and inherent prejudices relat- 
ing to the persons, organizations, and issues associated with each research effort. Only 
then may we expect researchers to be equipped to investigate and report accurately on 
influences that focus and restrict the outcomes they present. These proposals warrant 
brief elaboration. 

First, researchers need to consider the powerful and deleterious influences of first 
impressions and personal preconceptions and assumptions. As with practitioners, 
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they need to avoid hasty assessments and instead should view social situations, indi- 
viduals, and their own research efforts expansively. Second, researchers can enhance 
the quality of their work by expending effort to reflect critically on, acknowledge, and, 
as far as possible, accommodate opposing conceptions of reality in their studies. 
Ideally, the confining and potentially misleading influences of particular research 
orientations may best be avoided by accommodating multiple perspectives within in- 
dividual research initiatives. Of course, such effort to furnish readers with a range of 
viewpoints leads to divergent, at times competing, interpretations, explanations, and 
conclusions—conclusions that, as Jacknicke and Rowell (1987) observed, may be dif- 
ficult to reconcile. Moreover, this approach is frequently infeasible, given time and re- 
source constraints. At the very least, then, scholars have an obligation to reflect on 
and admit the limitations that chosen research orientations impose on their investiga- 
tions and resultant presentations of findings. Readers of their research reports can 
thereby be afforded some knowledge of the nature and extent of, and biases inherent 
in, those contributions to knowledge. Third, each researcher should endeavor to select 
and employ methodological techniques most suited to the research problems identi- 
fied. The temptation to accede to personal preference must be strenuously resisted, 
and so the need for dispassionate review of choices is vital. Fourth, each researcher 
should consciously and as “objectively” as possible (having regard for the perceptual 
influences noted earlier) subject to personal review all personal beliefs, prior concep- 
tions, and other known biases that relate to the educational issues under investiga- 
tion. Subsequently, wherever possible, these matters should be exposed to public 
scrutiny in final research reports. As Dimen-Schein (1977, pp. 13-14) entreated an- 
thropologists: “We always begin [research] with preconceptions. ... Such preconcep- 
tions organize our perceptions. ... We must therefore make them explicit, and we 
must state and explain the reasons for them.” Only in this way can readers judge the 
trustworthiness of researchers’ investigative decisions and the scholarship of their 
contributions. 

Strategies also exist to deal directly with the potentially prejudicial power of per- 
ceptions—of informants and researchers—on the trustworthiness of research 
outcomes. They have been explicated in detail in the literature and need only be men- 
tioned here. Nevertheless, the centrality of perceptions and likelihood of perceptual 
nonrepresentativeness and bias in educational endeavor highlight their relevance for 
all research initiatives. Educational writers (e.g., Bogdan & Biklen, 1982; LeCompte & 
Goetz, 1982; Lincoln & Guba, 1985; Merriam, 1985; Miles & Huberman, 1985; Owens, 
1982; Skrtic, 1985), primarily those of a naturalistic bent, have advocated strategies to 
enhance the truth value, applicability, consistency, and neutrality of research. They 
include triangulation, member checks, repeated/protracted observations in natural 
settings, and disciplined introspection (to establish “credibility”/internal validity), 
purposive sampling and thick descriptions (for the sake of “transferability”/external 
validity), and auditable disclosure and independent audit both of the research proce- 
dures (“dependability”/reliability) and of interpretations by researchers (“conform- 
ability”/objectivity). These strategies may serve different ends for different research- 
ers. For example, as Silverman (1985) observed, Denzin’s (1970, p. 472) defence of 
triangulation as a device for corroborating or “adjudicating between accounts” to min- 
imize bias and promote validity involves a use of this strategy for a positivistic intent. 
At the same time, by adopting an interpretive orientation, collection of multiple per- 
spectives can provide a range of individual constructions of circumstances and events 
and draw attention to possible sources of perceptual disparity. In view of the factors 
that affect individual perceptions in diverse ways, current perception theory lends 
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support to inclusion of these strategies, where appropriate, in both kinds of research. 

Educational researchers also have much to gain from other cautions considered 
and techniques deployed by field research colleagues in anthropology and sociology. 
Not only do ethnographers and participant observers draw on a wide range of cross- 
validating strategies, but they adapt methodological strategies and recheck their de- 
veloping interpretations as research needs demand (Wolcott, 1982, 1985; Van 
Maanen, 1979). As Wax (1971, p. 10) remarked, researchers who find themselves con- 
strained by “method, theory, or technique” need to “slip through the bars and try to 
find out what is really going on’—methodological creativity and versatility are em- 
phasized. Anthropologists are also particularly conscious of the limiting and distort- 
ing power of their own culturally created values, beliefs, preconceptions, and 
understandings of reality, and organizing categories. To acquire emic understandings 
of inhabitants’ experience, they strive to “place in parentheses” those personal per- 
spectives and “to always doubt the relevance of spontaneous moments of recognition” 
that may spring from their own prior conceptions or misconceptions (McDonnell, 
1979, p. 28). Indeed, prior conceptions are widely recognized as a blinding influence in 
educational and other familiar settings (Wolcott, 1985)—one that Spindler and 
Spindler (1982) found could be remedied only with extended time in the setting. As an 
additional strategy, field researchers’ notes record both direct accounts of informants’ 
perceptions and behavior and—separately—their own reflections and interpretations 
(Pelto, 1970; Spradley, 1979), helping to minimize the impact of the researchers’ per- 
ceptions on their developing understandings and subsequent thick descriptions. 
Personal diaries further allow field researchers to document and explore the influence 
of their own perceptions on the direction and conduct of inquiry throughout the 
course of their investigations. 


Conclusion 

The pursuit of an adequate theory of perception is far from complete. Nevertheless, 
research effort is now being directed toward explanation of individuals’ causal attrib- 
utions in educational and other settings, and researchers are seeking further under- 
standing of the significance and frequency of influence of various factors that 
intervene in the perceptual process. There remains a need for extensive study of the 
nature of perception, important factors, and extent of individual differences in per- 
ceptions in educational settings. Exploration of these avenues should provide greater 
insight than is now available on life in educational organizations. In the meantime, re- 
searchers who rely on their own and others’ perceptions for advancing knowledge 
about aspects of education need to be conscious of the factors that affect those percep- 
tions, and they have an obligation to remind readers that their findings portray reality 
only as it is viewed by themselves and their chosen informants. At the same time, 
there is consolation in the knowledge that perceptions, rather than objective reality, 
direct most attitudes and responses. Hence understanding of social behavior in educa- 
tional settings may continue to be advanced, provided that researchers recognize and 
acknowledge their individual research perspectives, personal preconceptions, and 
preferences, and utilize appropriate research strategies to maintain trustworthiness in 
their inquiry. 
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Notes 


1. There are two opposing schools of thought on the question of process. Adherents of the traditional, indirect perception 
approach contend that stimuli become meaningful only through a sequence of cognitive activities. Heil (1983), for example, 
distinguished sensory activity from the perceptual task of isolating specific features of stimuli and developing inferences or 
beliefs from these features, and he highlighted the latter as a subsequent, cognitive activity. On the other hand, Gibson’s 
(1979) “ecological view” of “direct perception,” defended by Michaels and Carello (1981, p. 157) and others, rejects the 
“impoverished input” theory and the frames of reference view of perceptual organization. They argue instead that perceivers 
are “active explorer[s] of the environment” seeking out meaningful information which informs them directly without a need 
for internal processing by perceivers; individuals admit meaningful stimuli selectively, and these trigger perceptions 
directly. > 

2. Limitations of these theoretical concepts should be acknowledged. To Neisser (1976), the selection of stimuli for perceptual 
processing may be a positive, rather than negative, filtering process; rather than using perceptual defences to discard items, 
individuals may actively choose desirable events from among those available. Furthermore, while much of the literature 
affords support for the concepts of constancy and threshold, Forgus and Melamed (1976) advanced rival explanations for 
the research findings on which these notions were founded, and Hochberg (1978) concluded from conflicting research 
reports that the concept of threshold in social perception is both arbitrarily defined and empirically tenuous. 


3. The self-attributional process is widely thought to operate in the same general way as for other persons, although Farr and 
Anderson (1983) disagreed with this view. 


4. Morgan’s (1986) use of the dramatic “psychic prisons” metaphor also reminds us of the power of language tools, vocabulary, 
and communication skills to shape perceptions. 
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BOOK REVIEW 


SCHOOLS IN THE WEST—ESSAYS IN CANADIAN EDUCATIONAL 
HISTORY. By Nancy Sheehan, J. Donald Wilson, and David C. Jones (Eds.). 
Calgary, AB: Detselig Enterprises, 1986, 323 pp. 


The editors of Schools in the West chose 14 articles for inclusion in this diverse com- 
pendium on education in western Canada from the 1870s to the 1970s. With the ex- 
ception of Brian Titley’s “Indian Industrial Schools in Western Canada” and Nancy 
Sheehan’s “The WCTU and Educational Strategies on the Canadian Prairie” which 
attempt a prairie perspective, each article focuses on aspects of schooling in a single 
western province. Most of the contributors are involved in teacher education, espe- 
cially in the history of Canadian education. The others are either general historians or 
educational administrators. 

The format and intent of the work have much in common with previous Detselig 
editions, including Shaping the Schools of the Canadian West (1979) and Schooling 
and Society in Twentieth Century British Columbia (1980), in which specific and 
often unrelated studies have been brought together under a general geopolitical 
theme. The essays in Schools in the West are grouped under four categories: (a) Cur- 
riculum; (b) Teachers; (c) Ethnicity, Women, Class; (d) Policy-Making and Legisla- 
tion. In the introduction Donald Wilson takes on the task of explaining why these 
topics were chosen and why certain articles were selected. 

In affirming that the selections are representative of the new agenda of Canadian 
educational history advanced at the annual meeting of the Canadian Historical Asso- 
ciation in 1985, namely a focusing “on the actual nature of the educational process it- 
self,” Wilson contends that the articles treat subjects and issues related to this 
process. It soon becomes apparent, however, that the educational process is not a cen- 
tral focus in the multi-authored collection. Although the essays are generally valuable, 
the work as a whole lacks the levels of synthesis and interrelatedness that one would 
hope for. Disparate discussions may well characterize the contents of journals, but 
they are out of place in works designed for undergraduate use. 

In commenting on Harro Van Brummelen’s discussion of the British Protestant 
vision of life held by British Columbia’s anglophone leadership, Wilson refers to a 
theme that pervaded the West; however, this vision is neither identified nor traced by 
most of the other contributors. A regional synthesis is not possible without an overall 
structure in which the parts have been fashioned with such a framework in mind. 
Disparate articles listed under such categories as textbooks and curriculum do not, as 
Wilson claims, lead to a regional synthesis. In fact the articles for the most part sug- 
gest that the region has neither polity nor community. Instead it tends to be depicted 
as an area largely peopled by those who came to an open space to escape history. 

Moreover, the interrelationships between educational process and general 
historical trends should not be enough for the educational historian. A third dimen- 
sion that involves relating analogical examinations of past experiences to present 
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circumstances and future possibilities is an integral element of the educational 
historian’s craft. An example of how this was not done in Schools in the West can be 
found in Lorne Downey’s discussion on “The Aid to Independent Schools Movement 
in British Columbia.” Wilson states that the “character of relationships, both bureau- 
cratic and political” is addressed in Downey’s chapter. He finds that Downey under- 
lines the value of “detailed description and analysis firmly set in time and place.” 
What Wilson believes to be necessary for good historical work is enunciated when he 
cites the British historian Harold Silver: “first, a concern with close and detailed de- 
scription and analysis, firmly set in time and place, and second a preoccupation with 
continuity and change, with crisis and with transformations” (Silver, 1983, 
pp. 239-240). 

Although Downey gives an outline of events that led to the enactment of legisla- 
tion that provided support for independent schools, he does not situate his data in any 
historical context or overarching theme other than to refer to Jean Barman’s chapter 
on “Transfer, Imposition, or Consensus? The Emergence of Educational Structures in 
Nineteenth Century British Columbia.” But Barman’s discussion, while it provides a 
thorough and valuable background, does not explain the sources and the long-lasting 
character of the resistance to private and religious schooling in British Columbia. Van 
Brummelen’s discussion on “Shifting Pespectives: Early British Columbia Text- 
books” is better in this regard, but as his account ends in the mid-1920s, it fails to pro- 
vide an analysis of the “continuity and change” so clearly required in a work that 
purports to give a regional synthesis. In like manner, while Downey does comment on 
a number of contemporary factors that led to the provision of aid to private schools, 
he does not discuss the implications of this decision on future policy orientations. 

Nancy Sheehan’s initial essay provides a quick overview of formal schooling in Al- 
berta between 1905 and 1980. After touching on three developments: nationalism, 
progressive education, and vocational and technical education, Sheehan concludes 
that the school curriculum has always given priority to matriculation programs. 
Expanding on Sheehan’s comments on the progressive movement, Morris Mott dis- 
cusses the beginnings of physical education in Manitoba in the early nineteenth cen- 
tury. And while both commentators refer to the imperialist orientations of the 
curricula, neither is moved to suggest that its remnants might still inspire much of 
what happens in today’s schools. 

The section on Teachers begins with John Calam’s article on the nature of early 
normal schools in British Columbia and an assessment of their worth by Harold 
Putman and George Weir in 1925. Calam argues that Putnam and Weir, as represen- 
tatives of the “new education,” rightfully condemned the mediocrity of normal school- 
ing. His discussion is quite persuasive, but it raises questions concerning the inferior 
status of teachers generally as well as the failure of the reformers to change teaching 
conditions. Robert Patterson in his short but informed discussion of teachers in rural 
Alberta during the depression finds them poorly prepared and generally isolated. 
John Lyons’ essay on the rise of the Saskatchewan Teachers’ Federation in the 1930s 
and early 1940s claims that as teachers adopted militant positions, their goals were 
more readily achieved. But both Patterson’s concluding remarks on modern begin- 
ning teachers and Lyons’ reference to the educational reforms following the CCF 
victory in 1944 leave one groping to find the source of their optimism. 

Most of the conclusions in the section on Ethnicity, Women, and Class are more 
promising. Brian Titley’s essay on Indian industrial schools reveals the deep-seated 
racism of Anglo and Franco Canadians. Titley’s comments come to full flower in his 
recent publication on Duncan Campbell Scott, A Narrow Vision (1986). That myths 
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about minorities are hard to put down is further demonstrated by Stella Hryniuk and 
Neil McDonald in their excellent article on the positive aspects of Ukrainian school- 
ing in pre-World War I Manitoba and by Clinton White in his carefully documented 
study of German Catholics and schooling in early Saskatchewan. Bill Maciejko pro- 
vides a good account of attempts by working-class radicals to change Winnipeg’s pub- 
lic school system in the era of the General Strike, yet his tendency to see the world 
solely in terms of bourgeois and working-class cultures shows a certain naiveté and be- 
comes somewhat tiresome. Maciejko’s heavy-handedness, however, is a good antidote 
to Sheehan’s article on the Women’s Christian Temperance Union which all but ig- 
nores that organization’s anti-urban, anti-immigrant and anti-Catholic bias. 

The final section on Policy Making and Legislation has already been referred to in 
the comments on Barman’s and Downey’s articles. One cannot leave Barman’s discus- 
sion, however, without challenging her statement that an education consensus existed 
in British Columbia by the late nineteenth century that “was so well-suited .. . that it 
would endure virtually unaltered for almost a century.” It may be that the die for 
schooling was cast by this time, but that it was “well-suited” is open to question. The 
reasons for the continuing nature of this consensus are not well defined. That it repre- 
sented majority opinion is quite possible, but whether the public school system that 
stemmed from the legislation of 1872 was but one of the available alternatives is not 
examined. 

That British Columbia’s educational system in this century has been complacent 
is manifested in Thomas Fleming’s account of the Province’s School Inspectors. Who 
the “Old Boys in the Field” were is not discussed other than that they came from the 
teaching ranks and that they “were more interested in performing school administra- 
tion than in theorizing about it.” Certainly one of their characteristics was an 
unwavering loyalty to the system. Fleming quotes one inspector to underline the pre- 
vailing attitude: “We were the Department’s men, selected by the Department, and 
responsible to the Department.” The inspectors’ roles were changed following the 
Second World War and while they were to give more time to supervision as opposed to 
inspection, it is far from clear that such a transformation took place. In 1973 when the 
Department gave large districts the right to hire their own superintendents, some as- 
sumed that a transfer of loyalties would occur. Fleming concludes that these changes 
resulted in the Department’s losing “much of its influence in shaping the character of 
school leadership in the Province.” Given the evidence presented, such a conclusion 
does not seem warranted. Fleming’s account, like the other three articles on British 
Columbia, might well have resulted in a different perspective if each contributor had 
access to the other manuscripts. If this had occurred an overall synthesis would have 
been possible. 

Leaving aside the diverse nature of the essays, what is noteworthy about many of 
them is their scholarship and research techniques as well as the insights that have 
evolved from such informed and careful study. David Jones’ article on “Schools and 
Social Disintegration in the Alberta Dry Belt of the Twenties” manifests all these 
characteristics. It is also an excellent example of how social and educational history 
can be brought together. Jones’ use of census data, school board minutes, and other 
primary sources gives a sense of the land seldom found in historical studies. In like 
manner Patterson’s use of oral history techniques provides a background that shapes 
his narrative and situates his observations in contexts that approximate the time and 
place under investigation. Such levels of sensitivity and scholarship augur well for the 
future of the history of education. 
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As Wilson points out, however, in this age of policy studies there is a danger that 
educational history will become little more than an addendum to whatever issues are 
currently in vogue. It is essential, therefore, that educational historians give more at- 
tention to the interconnectedness of issues and that they attempt to discuss the whole 
story. In this regard, Schools in the West demonstrates the progress that has been 
made and the work that has yet to be done. 
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THE EDITOR’S PAGE 


A Rule Change is Needed 


A rule in the academic game is that favorable decisions about salary, promotion, and 
tenure follow from publication in refereed journals rather than in nonrefereed 
journals or magazines. Observation of this rule requires the players to make a number 
of assumptions. One is that there is widespread agreement on the definition of a ref- 
ereed journal. Another is that the editorial practices of refereed journals generally ac- 
cord with this definition. Neither of these assumptions is supported by research (e.g., 
Journal of Higher Education, 1984, 55, 673-699). Still another assumption in faculties 
or schools of education seems to be that refereed journals are more influential than 
nonrefereed journals in advancing professional practice. There is little, if any, empiri- 
cal evidence which confirms this assumption. 

To have a more “honest” game—one which is congruent with all of the organiza- 
tion’s goals and which gives each player a better chance to win—we need a rule change 
and asupplement to the rule book. 

The rule change is simple. Treat all journal articles (and all other publications for 
that matter) in the same fashion. Assess them on the basis of their probable impact on 
the activities, behavior, or work of others. Both refereed and nonrefereed journals 
operate within similar spheres of influence in the professional and public sectors to 
foster change in education. Each offers ideas and information which indirectly, and 
over the long term, shape our view of reality, our vision of the achievable, our know- 
how, and our commitment to act. 

Judging a person’s articles in terms of their probable influence is not an easy task. 
But it is done all the time—formally and informally, systematically and haphazardly, 
wisely and unwisely. A major impediment is the lack of information about the aims, 
audience, and editorial practices of various journals. 

Thus, to aid in the implementation of the proposed rule change, we need to 
supplement the rule book with a new directory of Canadian publications in education. 
The existing list of education publications included annually in the CEA Handbook 
provides an excellent skeleton framework around which to build this more compre- 
hensive guide. The directory would incorporate information about the purpose, read- 
ership, and circulation of each journal or magazine. Publication of the directory might 
be a joint venture of the Canadian Education Association and the Canadian Society 
for the Study of Education. Since the directory would also assist prospective authors 
and facilitate communication, some financial support might be obtained from the So- 
cial Sciences and Humanities Research Council. 

A rule change in hockey or football is often made for the benefit of the fans. 
Similarly, this rule change ought to be beneficial for our “fans.” 


WHW 
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REX G.SHARMAN 


Carleton Board of Education 


Organizational Supports for Implementing 
an Instructional Innovation 


This empirical study investigated claims that implementation of educational innova- 
tions by teachers is enhanced by active support through evaluation, supervision, and 
staff development. The study measured implementation of a new mathematics pro- 
gram by 155 elementary teachers in Eugene, Oregon. Data were gathered through a new 
questionnaire. : 


The dependent variable comprised two dimensions: teacher level of use and fidelity of 
implementation. The independent variables were composed of the frequency and use- 
fulness of support received in assisting implementation. Eight plausible rival hypothe- 
sis for implementation were examined; five were significantly related to implementa- 
tion. The data were analyzed at two levels: across teachers and across schools. 


All hypothesis were supported at the school level of analysis, and two were supported at 
the teacher level. This study revealed significant relationships between the amount of 
organizational support to assist implementation and the degree to which the innova- 
tion had beenimplemented. 


Arends (1982) wrote that during the past decade a small but consistent literature 
evolved that spoke to the importance of administrative support for change efforts in 
educational organizations. He cited reports by researchers that appeared to contain a 
clear message: with active support from principals or deans, change efforts succeed; 
without such support, they fail. He claimed the meaning of active support was not 
consistent in those reports, but he found some behaviors that facilitated change 
efforts in schools or colleges. Those behaviors supporting change were: (a) promoting 
the interests of the project with verbal statements; (b) providing project participants 
with a sense of clarity and a measure of steadiness; (c) defending project goals and 
activities from dissenting voices and opposition; and (d) giving something of value to 
organizational members. Similar characteristics of support were discussed by 
Sergiovanni (1984) who emphasized that symbolic and cultural leadership signalled to 
others what was important and valuable in an organization. 


Dr. Sharman is presently a high school head of administration in Nepean, Ontario, anda part-time professor of educational ad- 
ministration at the University of Ottawa. 
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Implementing an Instructional Innovation 


Active support for change efforts in educational institutions is discussed in an- 
other set of literature, but that literature emphasizes support for change through eval- 
uation, supervision, and staff development. Alfonso (1984) stated that any serious 
effort to improve teaching or schools required specific programs of evaluation, 
supervision, and staff development. Fullan and Pomfret (1977) argued that support 
was needed to help teachers overcome problems encountered during implementation. 
Teachers required direct assistance from administrators or colleagues to overcome 
problems; receiving only verbal encouragement or symbolic leadership was not 
enough to ensure implementation of new programs. Joyce and Showers (1980), 
Natriello (1984), Neale, Bailey, and Ross (1981), and Showers (1985) all reported that 
support through evaluation, supervision (coaching), and staff development was essen- 
tial for instructional improvement practices to become institutionalized in schools 
and classrooms. Leithwood and Montgomery (1982) analyzed research about imple- 
mentation that revealed only 50% of elementary principals actually attempted to as- 
sist teachers in improving instructional programs, and they implied that instructional 
improvement would be enhanced if elementary principals actively supported or as- 
sisted their teachers in that task. These research reports, however, contained little 
empirical support for the claims of their authors. 

Instructional improvement efforts and program implementation are often closely 
associated. Although programs must ultimately be implemented in the classroom, 
“the quality of the process ... is enhanced or diminished by the quality of activity at 
the level above it” (Purkey & Smith, 1982, p. 64). Support provided to or received by 
teachers is likely a function or characteristic of the organizational context and not of 
individual teachers. Supports operate at the school level and can influence implemen- 
tation by individual teachers or a whole staff. Schools, as well as teachers, may differ 
on how effectively they implement changes; the quality of systems of support might 
aid in differentiating levels of school effectiveness. 

The purpose of the study was to determine whether implementation was influ- 
enced by differing degrees of support received by teachers. Empirical evidence was 
sought to substantiate the claims of the previously-mentioned authors that active 
support for teachers enhanced implementation; such evidence was generally lacking. 
In this study, support was limited to assistance received through evaluation, 
supervision, and staff development—the three dimensions advocated by Alfonso 
(1984). Evidence that support for teachers enhanced implementation could benefit 
educational systems; those supports could be included in implementation plans or 
policies to improve outcomes. 


Method 
This verificational study was developed as a correlational design. The literature re- 
vealed a direct relationship should exist between support and implementation. Three 
hypothesis were advanced as predictions for testing: 
The more evaluation supports the implementation of the innovation, then the greater 
the degree of implementation that will result. 
H, The more supervision supports the implementation of the innovation, then the greater 
the degree of implementation that will result. 
The more staff development supports the implementation of the innovation, then the 
greater the degree of implementation that will result. 


In addition, the relationship between the presence of all supports and the degree 
of implementation was examined. 
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The study was conducted in a public school district in Eugene, Oregon. The inno- 
vation chosen to carry the dependent variable was an elementary school mathematics 
program called Real Math. In 1982, Real Math was one of many commercially-pre- 
pared programs considered to support a revised scope and sequence document in 
mathematics. Real Math was selected in the spring of 1983 to support the document, 
and implementation began in the fall of 1983; it was the only program of mathematics 
books and materials to be used in elementary schools. The study was conducted in the 
fall of 1985. 

The dependent variable, implementation, was defined as the level of use of the in- 
novation and the closeness of fit or the fidelity with which the innovation as imple- 
mented matched system requirements. Those two dimensions of implementation 
were derived from the Concerns-Based Adoption Model of implementation of Hall, 
Loucks, and others. Implementation was measured by combining (a) the measure of 
the closeness of fit between the teacher’s actual use of the innovative program and the 
ideal use specified by the district (fidelity), and (b) the measure of the teacher’s level 
of effective use (level of use). Those two dimensions were subscores that were com- 
bined as the measure of implementation. Fidelity was measured using an innovation 
configuration checklist developed according to the method of Heck, Stiegelbauer, 
Hall, and Loucks (1981). Teacher level of use of Real Math was measured through 
behaviorally-anchored rating items developed from definitions of levels of effective 
use of an innovation (Loucks, Newlove, & Hall, 1976). 

Organizational support for implementation of Real Math was limited to evalua- 
tion, supervision, and staff development, and they could come from several sources. 
Evaluation or supervision could be received from the principal, colleagues, or district 
office personnel. Staff development, however, was usually a school function or charac- 
teristic that included formally planned activities planned or supported by administra- 
tors. Weighted frequency scores—indexes that additively combined the frequency of 
support received with the measure of usefulness of that support—were calculated for 
evaluation and supervisory support received from principal, colleagues, and district 
personnel (six scores in total). Support through staff development was measured by 
combining a measure of principal support through staff development and usefulness 
of the program(s). The independent variable scores for evaluation and supervision 
combined the weighted frequency scores for that support received from the principal, 
colleagues, and district personnel. The score for staff development combined the 
weighted score for staff development with teacher assessments of the usefulness of 
district in-service programs. 

Implementation, however, is very complex. According to Fullan and Pomfret 
(1977), “given the complexity of .. . implementation, the factors that can plausibly in- 
fluence it are potentially enormous in number” (p. 367). Those authors found a num- 
ber of variables that influenced implementation and they asserted that any study of 
implementation should, of necessity, investigate a wide range of variables that could 
influence it. The study investigated a number of plausible rival hypothesis for imple- 
mentation, and they were: 


1. Age: Rogers (1962) claimed that early adopters were younger in age than late 
adopters and implied an inverse relationship between adopters’ age and degree of 
implementation. Plausibly, psychological characteristics associated with age, 
rather than organizational support, could influence implementation. 


2. Teaching experience might also influence implementation rates, and an inverse 
relationship was expected. More experienced teachers would be less innovative 
than less experienced teachers. 


238 


Implementing an Instructional Innovation 


3. Problem-solving utility: Rogers (1962), and Berman and McLaughlin (1976), em- 
phasized that if adopters believed an innovation would solve existing problems 
they would be more likely to adopt the innovation than if it was simply added on; 
the more useful, the greater the implementation—a direct relationship. Teacher 
expectations concerning Real Math’s usefulness to their teaching of mathematics 
were measured. 


4. Explicitness: Fullan and Pomfret (1977) claimed that teachers were often unable 
to describe the essential features of an innovation they were supposed to be using. 
Low implementation could result from confusion rather than from lack of organi- 
zational support. The more teachers were confused about the innovation, the less 
implementation would be expected. Teacher confusion about the purposes of Real 
Math was measured. 


5. Complexity: Fullan and Pomfret (1977) wrote that innovations requiring complex 
changes in teacher behavior were difficult to implement satisfactorily. The more 
complex the innovation, the less the implementation. Teacher expectations con- 
cerning complexity of Real Math were measured. 


6. Fullan and Pomfret (1977) wrote the few studies investigated whether voluntary 
or compulsory participation in programs affected implementation. Although use 
of Real Math was compulsory among teachers, participants’ attitudes toward the 
compulsory use of Real Math were measured; did attitudes toward compulsory 
participation relate to differing degrees of implementation? 


A 55-item questionnaire was developed to gather data. The questionnaire was lim- 
ited in length by district personnel who felt that a longer document would discourage 
teachers from responding. Items to measure four factors of fidelity of implementation 
were developed cooperatively with the District’s teacher on special assignment for 
mathematics; she knew how successful implementation would be identified. The 
behaviorally-anchored rating items to differentiate four levels of use were developed 
by the author, and they were pretested on a small group of teachers from another 
board. The dependent variable was measured using 32 items; 14 measured level of use 
and the remainder measured fidelity of implementation. The independent variables 
were measured using 14 items; five of those measured frequency of support and the re- 
mainder measured the usefulness of those supports in helping teachers implement 
Real Math. Eight aspects of the plausible rival hypothesis were measured. 

The two dimensions of the dependent variable were measured as summated scales 
combining Likert-type items: 

1. The 18 items comprising the fidelity dimension were, with recoding, scaled in the 
same direction and were additive; the greater the total fidelity score, the greater 
the degree of implementation. The questions concerning fidelity asked teachers 
either: (a) how often they used specified components of Real Math; or (b) to rate, 
on a five-point scale ranging from “always true of me” (5) to “never true of me” (1), 
how they applied specified behaviors associated with Real Math. Teachers whose 
use of Real Math matched the ideal described by the district would score 90, and 
teachers whose use departed most from the ideal would score 18 (if they responded 
to all items). The mean of the item responses was used to measure fidelity of im- 
plementation. 

2. The level of use subscore contained 14 items measured on scales of 1 to 5. With 
recoding, the level of use items were additive; the greater the total score, the more 
one knew about or was doing to implement the innovation. Like fidelity, level of 
use was calculated as the mean of the item scores. 
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The degree of implementation of Real Math was obtained by averaging the mean 
of items for fidelity and the mean of items for level of use to yield a maximum possible 
implementation score of five. That method of scaling allowed each dimension to be 
weighted equally in the dependent variable score. 

The independent variables were measured as weighted frequency scores (as de- 
scribed earlier) that combined quality and quantity of support received. The plau- 
sible rival hypothesis, with the exception of years of teaching experience, were 
measured as discrete variables. 

The unit of observation was the teacher. Data from teachers in each school were 
aggregated to describe the nature of support and the level of implementation in their 
school. The school district wanted as much data as possible from the research, partic- 
ularly data describing fidelity of implementation. A complete census of elementary 
school teachers of mathematics of grades 1 through 5 was undertaken, and the study 
population was 283 teachers. Questionnaires were hand-delivered in bulk by the au- 
thor to the principal of each elementary school; the principal was responsible for dis- 
tributing and collecting them from the teachers. The study was conducted in early 
December 1985, and the timing—at the end of a long term—might have contributed 
to a low completion or return rate. District policy or precedent allowed teachers to de- 
cide whether they would participate in the study, and principals could decide—either 
unilaterally or collectively with their staff—whether a school as a unit would partici- 
pate. All but two of the 31 schools participated. 

The achieved sample in the study was 155 teachers, a participation rate of 54.8%. 
Six questionnaires contained so much missing data that they were excluded from the 
analysis. The achieved sample of schools or organizational units was 22. That N= 22 
resulted from: (a) combining data sets where two organizational units were adminis- 
tered by one principal and one of the units returned so few questionnaires that analyz- 
ing that unit separately would have had limited value (2 cases), and (b) eliminating 
schools where fewer than 30% of the staff responded to the questionnaire (5 cases). In 
such instances, such a small percentage of responses from a staff could bias a school’s 
score and the subsequent analyses by school. 


Results 

Table 1 summarizes fidelity, level of use, and degree of implementation scores associ- 
ated with measuring the dependent variable. Scores on each scale could range from 1 
through 5. A high score indicated either (a) ideal program implementation by a teach- 
er or school, (b) routine or integrated use of Real Math by teachers or in schools, or 
(c) a high degree of implementation as shown by the combined subscores. A low score 
indicated either (a) less than ideal implementation, (b) that teachers were experienc- 
ing difficulty or implementing only parts of the program, or (c) a low degree of imple- 
mentation as shown by the combined subscores. 

The data show a wide range of fidelity of implementation and levels of use across 
teachers and schools. The more narrow ranges across schools reflect the effects of 
averaging teacher data. 

The strength of the relationship between the two subscores that were combined to 
measure degree of implementation was assessed. The strength of the relationship be- 
tween those two subscores was r=.269 (teacher level, N=149) and r=.467 (school 
level, N= 27). Those relationships support measuring the degree of implementation of 
an innovation through the dimensions of level of use and fidelity, although it is pos- 
sible that two different factors of implementation were being measured. 
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TABLE 1 


Fidelity, Level of Use, and Implementation Scores 


Range 

Dimension Unit of Analysis Mean Score (SD) Low High 
Fidelity Teacher (N=149) Sool (Oa) Aclih Aleve 

School (N=22) On Oe) 7A 6) Oe: Os) 
Level of Use Teacher (N=149) ole (0.75) 1:62 VES 

School (N=22) oF FAXO.24) ASL oXl) 
Implementation Teacher (N=149) 3.230 a(0:39) DVT c4s6i 

School (N=22) Cae (O23) ZO Om) 


The relationship between each plausible rival hypothesis and degree of implemen- 
tation was examined to determine if a significant relationship existed. Data from 
those variables had been coded to reflect the relationship suggested in the source lit- 
erature: (a) the greater the clarity (the less the confusion), the greater the implemen- 
tation, a direct relationship was expected; (b) the greater the problem-solving utility, 
the greater the implementation, a direct relationship was expected; (c) the less favor- 
able the attitude resulting from compulsory participation, the less the implementa- 
tion, a direct relationship was expected between attitude toward compulsion and 
implementation; (d) the less difficulty in using an innovation, the greater the imple- 
mentation; a direct relationship was expected between ease of using or adapting to 
Real Math and implementation; (e) an inverse relationship was expected between age 
and degree of implementation; and (f) an inverse relationship was expected between 
experience and implementation. 

Two plausible rival hypothesis were significantly related to the dependent vari- 
able in the analysis by teachers (the teacher as the unit of analysis). They were: 
(a) clarity about Real Math (when implementation occurred), and (b) the problem- 
solving utility of Real Math. It was plausible that significant relationships between 
the independent variables and degree of implementation might result from the effects 
of clarity or utility and not from support for implementation, and those significant 
rival hypothesis were used as statistical controls during the hypothesis tests. 

Many teachers reported they were clear about most or all aspects of Real Math, 
which might indicate either the goals of the program had been adequately communi- 
cated to them or the program matched teachers’ past experience or the program was 
philosophically acceptable to them. These teachers implemented better than those 
who were more confused. Almost two-thirds of the sample believed Real Math had 
little utility for them, and their mean implementation scores were much lower than 
those who believed the program would reduce problems in teaching mathematics. 

Teacher data were aggregated to the school level by obtaining school means for the 
variables. Five plausible rival hypothesis were significantly related to the dependent 
variable in the analysis by school. Two of them—clarity and utility—had been signifi- 
cant in the analysis by teachers, and compulsion, difficulty or complexity of Real 
Math, and age were significant across schools. 

Fullan and Pomfret’s (1977) claim that compulsion, or attitudes resulting from 
compulsion, might plausibly explain implementation was supported at the school 
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level. Similarly, their claim that the complexity or expected difficulty of the innova- 
tion might influence implementation was supported. The relationship between age 
and implementation was surprising; although significant, it failed to support Rogers’ 
(1962) claim because a direct relationship was found. Rogers had predicted an inverse 
relationship. Perhaps greater experience with a variety of mathematics programs 
meant that older teachers were more adaptable; that is, they already possessed skills 
required in the new program. | 


Testing the Hypotheses 

The hypothesis were tested through multiple regression in which the significant rival 
hypothesis were used as control variables. Testing was conducted across teachers and 
across schools. The control variables were entered as a block into a regression equa- 
tion, and then the independent variable (or its components) was entered as a separate 
block. The strength of the R? change resulting from entering the independent variable 
showed whether a significant relationship existed. Summary data of those analyses 
are presented in Table 2 (across teachers) and Table 3 (across schools). 

Table 2 reveals that H, and Hg were supported across teachers when the effects of 
the significant plausible rival hypothesis were controlled. The positive beta weights of 
the principal components indicated a direct relationship; the greater the support re- 
ceived through evaluation or staff development, the greater the degree of implemen- 
tation. Hy, however, was not supported. - 

Table 3 shows that all hypothesis were supported across schools when the effects 
of the five significant rival hypothesis were controlled. Again, the positive beta weight 
for each independent variable indicated a direct relationship; the greater the support 
received, the greater the degree of implementation. 

The relationship between support received from all sources and implementation 
was examined to find out how much stronger the relationship became when all sources 
were present. In the analysis across teachers, the presence of all supports accounted 
for only an additional 3.3% of the variance in implementation than contributed by 
the strongest single support (R? of evaluation=.217 [Table 2], R? of multiple sup- 
ports=.250). Across schools, those multiple supports accounted for an additional 
6.6% of the variance on the dependent variable (R? of support through staff develop- 
ment=.625 [Table 3], R? of multiple supports =.683). 


Post Hoc Analyses 

One-way analyses of variance using individual teacher data were run between schools 
and each type of support (the components of the independent variables in Table 2). 
Those analyses allowed testing of the reliability of teacher responses through the for- 
mula r=(F-1)/F (Winer, 1971, p. 287). There was moderate agreement among 
teachers of the value of evaluation support from principals (r=.41), but reliability co- 
efficients of .00 were obtained concerning colleague or district evaluation. Concerning 
supervision, there was moderate reliability among teachers of the value of supervisory 
support from the principal (r=.45), but there was less agreement about the value of 
support from colleagues (r=.28) or district personnel (r=.34). 

Factor analysis was performed on the fidelity and level of use dimensions of the 
dependent variable to check the validity of those portions of the questionnaire. The 
items measuring fidelity loaded perfectly on four factors; four factors had been inves- 
tigated. The items measuring levels of use loaded on five factors; four levels had been 
investigated. The items measuring non-use and routine use loaded perfectly, but 
oll measuring preparatory use or integrated use did not clearly differentiate the 
subjects. 
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TABLE 2 


Summary Data: Hypothesis Testing Across Teachers 


Variable or Component Beta Re R? Change 


Control Variables 


Clarity at Adoption .188 

Problem-Solving Utility .261 ela. 

Independent Variables 

Evaluation 

Control Variables (above) Ti 

Principal Evaluation .170 

Colleague Evaluation .265 

District Evaluation 5042 2h .106* 
Overall F=6.499 df=5/117 

Supervision 

Control Variables (above) aid 

Principal Supervision ay] 

Colleague Supervision — .044 

District Supervision .116 143 032 
Overall F=3.916 df=5/117 

Staff Development 

Control Variables (above) a Mil 

Principal and Staff Dev. .254 

District In Service = 39 Gt .056* 
Overall F=5.946 df=4/118 

7 <5 
TABLE 3 
Summary Data: Hypothesis Testing Across Schools 
Variable or Component Beta R? R? Change 


Control Variables 


Age VEY) 

Clarity at Adoption 081 

Complexity 114 

Problem-Solving Utility 2385 

Compulsion .064 .430 

Independent Variables 

Control Variables (above) .430 

Evaluation 402 569 .138* 
Overall F=3.298 df=6/15 

Control Variables (above) .430 

Supervision 593 604 174* 
Overall F=3.816 df=6/15 

Control Variables (above) .430 

Staff Development .562 625 195" 


Overall F=4.174 df=6/15 


*p<.00 
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Conclusions 

Controlling for other relevant things, all three hypothesis were supported across 

schools, and two of those hypothesis were supported across teachers. Therefore, the 

notions that the more evaluation, supervision, and staff development support the im- 

plementation of the innovation, then the greater the degree of implementation of the 

innovation have been supported by the data. 

Analyzing the data at two levels instead of just at the teacher level, which Sirotnik 
and Burstein (1985) claim is “the one most often used and least often, if ever, relevant” 
(p. 171), was important. Grouping teacher data by school and then analyzing across 
schools produced stronger relationships than analyzing across teachers. Those ana- 
lyses also took into account the organizational units in which support and implemen- 
tation occurred; analyzing data across teachers could wash out differences that 
resulted from differences in organizational context. The stronger relationships rein- 
forced the notion that support for implementation and the degree of implementation 
reflects the context in which implementation occurred. Furthermore, had the data 
been analyzed at one level—that is, only across teachers—the significance of support 
through supervision as a school variable would not have been established. 

The inclusion in the study of a number of threats to implementation as plausible 
rival hypothesis was supported, and the significant relationships between them and 
implementation supported work by Berman and McLaughlin (1976), Fullan and 
Pomfret (1977), and Rogers (1962). Furthermore, the use of significant variables as 
statistical controls during hypothesis testing demonstrated the ceteris paribus im- 
plied in each hypothesis. 

The data were interpreted to support the hypothesis: more support leads to more 
implementation. There is a major difficulty in attributing causation in correlational 
studies of this type because a reverse direction of causation is plausible. Teachers who 
implemented the innovation well could have sought and received much support, and 
those teachers could have been more comfortable than low implementors in accepting 
help. Implementation could lead to acceptance of support. 

This study was a “one-shot case study” (Campbell & Stanley, 1966, p. 6); all vari- 
ables were measured at the same time. The number of items in the questionnaire was 
restricted, and respondents had to rely on memory to report events that occurred two 
years before the study was done. Future studies could be strengthened by carrying out 
longitudinal studies that would allow the following: 

1. Measure the plausible rival hypothesis for implementation at a different time 
from the independent and dependent variables. Clarity, problem-solving utility, 
complexity, attitude resulting from compulsion, or other plausible rival hypothe- 
sis could be measured when implementation begins. In that way memory, and per- 
haps the benefits or frustrations experienced during implementation, would not 
color teacher responses. Rival hypothesis could be measured again when the de- 
pendent variable is measured. 


2. Measure the frequency and helpfulness of support for implementation each year 
or half year. It would also be possible to have principals, teachers, and district per- 
sonnel report the frequency with which they provided help and who they helped. 

3. Measure the degree of implementation at a number of points in time. 

4. The researcher would have the opportunity to observe the process of implementa- 


tion, and would have the chance to use methods of measurement other than just a 
questionnaire. 
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This verificational study supports the notion that the more organizational support 
that is received to assist with implementation, then the greater the degree of imple- 
mentation. The study points to the important role that principals play in instruc- 
tional improvement activities or implementing new programs. Principal support 
through evaluation and creating conditions favorable for implementation through 
supervision and staff development were significantly related to implementation, 
thereby supporting Arends (1982) and Alfonso (1984). 

An important implication is that principals should be the initial focus of efforts to 
implement an innovation. Principals must become convinced that the new program is 
useful and desirable. Without their support, it seems the chances of implementation 
are not as good as when principals are supportive. Principals should understand the 
innovation; they are the first significant line of support for teachers during the imple- 
mentation, and they should understand the links between stages or grade levels with- 
in a program. Principals can create conditions in schools that assist implementation, 
such as fostering staff development activities and peer supervision or coaching as 
school norms. In short, the characteristics of effective principals described in 
Leithwood and Montgomery (1982) should be developed for every school program. 
Principals must implement the innovation first. 
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Critical Incidents in the 
Supervision of Student Teaching 
in an Extended Practicum 


A number of Canadian teacher education programs have been revised to include a con- 
tinuous longer period of student teaching. Throughout such an extended practicum, 
the subjects of this study submitted critical incidents in the supervision of their student 
teaching. The incidents were categorized according to an amended version of Broudy’s 
(1965) problems of practice comprising the education profession and the categorization 
compared with that obtained in a distributed practicum setting. Extended practicum 
student teachers, like their distributed practicum counterparts, submitted incidents 
unevenly across the categories. Also, they submitted significantly fewer management- 
organization type incidents and significantly more interpersonal incidents than dis- 
tributed practicum student teachers, although the latter were less likely to be negative. 
Among secondary extended practicum student teachers there was a tendency to submit 
more management-organization, more teaching-learning and fewer interpersonal criti- 
calincidents than elementary extended practicum student teachers. 


This research was undertaken to study critical incidents in the supervision of student 
teachers engaged in an extended practicum or internship experience. It followed an 
earlier phase of the same project (Housego & Boldt, 1985) which found such incidents 
reported by student teachers, sponsor teachers, and faculty supervisors engaged in a 
series of shorter school experiences, a distributed practicum, to be very unevenly 
spread over a set of categories based on Broudy’s (1965) problems of practice in the 
education profession. The Broudy categories—aims and objectives, organization- 
management, curriculum, teaching-learning, and professional content—were amend- 
ed through the addition of a sixth category encompassing incidents focused on the in- 
terpersonal relationship of the student teacher and his or her supervisors. 

According to Broudy (1965), the problems of practice which he describes establish 
education as a unique profession. That being the case, one would expect these prob- 
lems to be at the heart of the professional education of teachers and to constitute the 
major substantive content of their student teaching experience. Various reasons for 
the abundance of incidents in some categories and the absence or near absence of 
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incidents in others—the nonrealization of these expectations in the distributed prac- 
ticum study—were advanced (Housego & Boldt, 1985, pp. 120-121). The greater than 
expected number of incidents in the management-organization and in the interper- 
sonal categories was particularly noteworthy. These emphases, it seemed, might be 
linked to the marginal nature of the student teacher’s position (Clifton, 1979, p. 63) 
and to the often pressured and stressful relationship of the student teacher with his or 
her supervisors. The student teacher tries to please both the sponsor teacher and the 
university supervisor and, in addition, demonstrate a high degree of teaching compe- 
tence, all within a relatively short time span. 

In an extended practicum, the time limitations of the shorter distributed practi- 
cum are much reduced, if not eliminated. With additional practice time, one might ex- 
pect a student teacher to become much more adept at handling management and 
organizational problems in the classroom. In addition, his or her interpersonal rela- 
tionships with supervisors, particularly the sponsor teacher, could be improved, be- 
cause there would be more time to clarify the role expectations held for student 
teachers. Even the pressures of evaluation could be somewhat lessened in an extended 
practicum with more time for the student teacher to respond to feedback from his or 
her supervisors, more time to demonstrate teaching competence, and more evidence 
on which to base important evaluative decisions. 

These speculations as to the possible effects of extending the length of time spent 
in classroom practice prompted the planning of a study of the distribution of critical 
incidents in the supervision of student teaching over the amended Broudy categories 
during a longer period of practice in schools, a time span of months rather than weeks. 
In an extended practicum, would critical incidents in the supervision of student 
teaching be spread more evenly across the Broudy categories? Would negative inter- 
personal relations critical incidents in the supervision of student teaching be fewer in 
number? 


Theoretical Perspectives 

Student teaching is considered to be an important part of teacher education, an occa- 
sion when theory and practice are melded. Ideally, it is a time when student teachers 
learn to criticize their own classroom performances in the light of both the best craft 
rules available and research findings on teaching. Although research on supervision, 
particularly studies based on data provided by the participants in the supervisory 
process, was once limited (Zeichner, 1980, p. 46), this is no longer the case; in fact, one 
might say there is a substantial beginning research effort directed toward understand- 
ing the supervisory process and its effects within the larger context of the socialization 
of student teachers (Lortie, 1975; Zeichner & Grant, 1981). 

Apart from studies of the effects of school experience on student teacher attitudes 
and control ideologies (Tabachnick & Zeichner, 1984; Tardif, 1985; Zeichner & Grant, 
1981), anumber of researchers (Fuller & Bown, 1975; Gibson, 1976; Tardif, 1985) have 
documented, in those learning to teach, a developmental sequence beginning with 
survival concerns, progressing to teaching situation concerns, and finally moving to 
concerns for pupil needs. It is thought that student teachers, particularly in a brief 
school experience, may be unlikely to progress beyond the first stage of survival con- 
cerns, unless they are given more opportunity to master the survival and technical 
type of problems of the first two stages and are stimulated to become more reflective 
about their teaching. Supervisors of student teachers face the task of assisting student 
teachers scale this developmental ladder, at least to the point of successfully dispatch- 
ing the survival concerns to which attention must be given before it is directed to 
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lesson presentation and individual pupil needs. 

Some attempt to address this problem has been made through exploring various 
models of supervision. Some models are based on analysis of the conceptual level and 
content of the dialogue of the postconference (Blumberg, 1977; Grimmett, 1981; 
Hunt, 1977-1978; Sprinthall & Thies-Sprinthall, 1983; Zeichner & Liston, 1985). 
Other models describe supervisory behavior and categorize various approaches to 
supervision (Glickman, 1981; Horwood, 1981; Housego & Grimmett, 1983; Olson, 
1982). It is difficult to know which supervisory approaches are most effective and 
under which circumstances, and although supervisees’ preferences for particular 
styles of supervision (Blumberg, 1977; Copeland, 1982; Nias, 1977) provide some guid- 
ance, more information about the kinds of supervisory interventions which are in fact 
made in various types of field experiences is required. In addition, it would be helpful 
to know how student teachers perceive supervisory interventions. What is helpful to 
them and what is not? The critical incidents technique has been used to study 
supervisory behaviors in several contexts—faculty supervision of high school house- 
hold economics student teachers (Dirks, 1967), the supervision of student nurses in 
nursing practicum (Drachelet, 1981), and more recently the critical behavioral re- 
quirements of cooperating teachers (Copas, 1984). It is used in this study to collect in- 
formation about the supervision experiences of student teachers in an extended 
practicum in order to permit comparison with the supervision experiences of student 
teachers in the previously described distributed practicum. 


Methods and Data Sources 

The critical incident technique (Flanagan, 1954) is designed to collect an extensive 
range of incidents from those who are in a position to report what facilitated or 
hindered the aim of an activity. In this study, the activity of ultimate concern was the 
teaching performance of student teachers. Since student teachers themselves are per- 
haps in the best position to make judgments as to the effects of various events on their 
teaching performances, their reports of critical incidents in the supervision of their 
student teaching were sought. In this study, a critical incident in the supervision of 
student teaching is a supervisory incident identified as having had an impact, positive 
or negative, on the activity of ultimate concern—the student teacher’s teaching per- 
formance. An “incident” in the supervision of student teaching becomes a critical inci- 
dent if the student teacher identifies it as having had such impact. The instrument 
used for collecting these incidents (see Appendix) does not unduly restrict the kind of 
incident which might be reported; accordingly, it was expected that student teachers 
would be able to document incidents representing the full range of Broudy’s categor- 
ies if, in fact, these incidents were perceived as occurring in a notable way in their 
supervision. 

Broudy’s categories, which lie in the public domain, were used instead of more idi- 
osyncratic categories drawn on the basis of the incidents themselves, as is the common 
practice in the Flanagan critical incidents method. They have been accepted over 
time as constituting a rationalized reconstruction of professional activity in educa- 
tion. Put another way, the categories represent the accepted spectrum of intended 
activities comprising the education profession. Consequently, classifying the 
incidents according to these categories is a check on the congruency of intents and re- 
ality, something of a cross-check on the comprehensiveness of the student teaching 
experience as mirrored in critical incidents selected from it. 

Student teachers enrolled in an extended practicum, an internship experience, 
were asked to take part in this study. Before consenting, they were involved in a work- 
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shop session in which the purpose and procedures of the research and the concept of a 
critical incident in student teaching supervision were explained, and practice in re- 
cording a few dramatized incidents was provided. Seventeen student teachers in a 
newly established elementary internship program and 10 student teachers in a sec- 
ondary internship, in its fifth year of operation, became regular participants in the 
project. On a weekly basis they contributed reports of critical incidents in the 
supervision of their student teaching by their sponsor teachers and faculty supervi- 
sors. 

The elementary interns, with a median age of 21 and ages ranging from 20 to 33 
were slightly younger than the secondary ones, whose median age was 23 and whose 
ages ranged from 21 to 45. The elementary interns had more uniform previous educa- 
tional experiences—generally two previous practica—whereas those in the secondary 
pattern had had from 0 to 3 previous blocks of school experience. All the elementary 
interns, and all but one of the secondary interns, were in the fourth year of a four-year 
education degree program. The one secondary intern participating in the project who 
was a transfer to the Faculty of Education after a previous degree was, therefore, the 
only participant with no school experience. 

The elementary and secondary internship programs are alike in terms of their lo- 
cation, the monthly stipend provided student teachers, and the interview process by 
means of which participants are selected by school principals. In both programs 
student teachers are supervised by their sponsor teachers and by faculty members 
who are also their course instructors. 

The elementary and secondary internship programs differ somewhat. Whereas 
secondary interns are located in schools from September to April, teaching half-day 
and taking courses the other half, elementary interns teach over the whole day with 
the exception of Fridays, when they pursue coursework on campus. In addition, they 
return to campus for coursework for a month before Christmas. Secondary interns 
work continuously with the same classes, generally in two subject areas; elementary 
interns work before Christmas with one sponsor teacher and class and after Christmas 
with a second sponsor teacher and class, generally at a different grade level. 

In order that some probable antecedents of the reported critical incidents might 
be explored, each student teacher was asked to complete a brief information sheet 
providing demographic data and a description of his or her interpretation of the roles 
of the sponsor teacher and the faculty supervisor in the extended practicum. 

The elementary and the secondary critical incidents in the supervision of student 
teaching were systematically collected on a weekly basis and edited. They were classi- 
fied independently and with no disagreement by two persons according to the amend- 
ed Broudy categories which were validated in the previously mentioned earlier study 
(Housego & Boldt, 1985). The total number of incidents, both elementary and second- 
ary, in each category was determined, and the percentages of incidents in each of the 
six categories compared with those found in the distributed practicum setting of the 
first study. Differences were tested for significance. 


Report and Discussion of Findings 
During their internship year, the 17 elementary students reported 224 and the 10 sec- 
ondary student teachers 179 critical incidents in the supervision of student teaching. 
Tables 1, 2, and 3 present, respectively, the distribution over the six categories of the 
elementary incidents, the secondary incidents, and the total number of critical 
incidents collected in the extended practicum. Table 4 (Housego & Boldt, 1985, 


p. 118) presents the distribution of incidents over the same categories in a distributed 
practicum. 
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The tables indicate that the elementary and secondary interns separately, as well 
as in a total group, like the students of the previous study (Housego & Boldt, 1985), 
did not report incidents in their supervision experience having to do with aims and 
objectives, curriculum or professional content at the level of frequency which might 
be expected in the internship phase of a professional, integrated program. As in the 
previous study, student teachers reported the greatest numbers of critical incidents in 
the areas of organization and management, the basic elements of teaching-learning 
strategies, and interpersonal relationships with their supervisors. 

The impression that emerges from a close examination of both the content of the 
incidents and the tables is that, in both the extended and the distributed practica, 
much of the substantive content of previous professional training was not considered 
noteworthy enough by any of the three parties involved—the student teacher, the 
sponsor teacher, or the university supervisor—to be made the focus of supervisory in- 
teraction frequently and thereafter reported in critical incidents. This finding is 
consistent with Tabachnick and Zeichner’s (1984) description of the instrumental 
perspective, which has emerged in various studies of the impact of student teaching 
experience on the development of teaching perspectives by student teachers—“what 
works in the short run to get the class through the required lesson in a quiet orderly 
manner becomes the major criterion for evaluating a teaching activity” (p. 30). 

The difficulty of linking university coursework and practicum experiences and the 
likely predominance of the instrumental perspective seem apparent as one analyzes 
the content of the collected critical incidents. Presumably, if university classroom 
work focuses on problems of practice in the education profession, these problems 
should be highlighted during actual practice in the classroom, particularly over an ex- 
tended period of time. The tables, however, reveal an uneven distribution of reported 
critical incidents over the problems of practice, and leave one to speculate about the 
reasons for the seeming absence of attention to some very significant issues. 

Part of the problem may be rooted in the dynamics of the interaction between 
sponsor teacher and student teacher. According to Zimpher, de Voss, and Nott (1980, 
p. 13), neither sponsor teachers nor student teachers place high priority on observa- 
tion time for student teachers. Instead, in less than a week students become involved 
in full-time teaching. Moreover, interaction between sponsor teachers and their 
student teachers reveals a conscious avoidance of conflict and a tendency not to en- 
gage in substantive discussion (Olson, 1982; Tabachnick, Popkewitz & Zeichner, 
1979-1980). This is not surprising if, as Book, Byers, and Freeman (1983) suggest, 
preservice teachers feel no strong need to acquire a knowledge base in pedagogy in 
order to become effective teachers. They are confident in their own abilities and pre- 
fer to try to show immediately that they can do the job, a point of view in keeping with 
the previously mentioned instrumental or “how to” perspective concerning student 
teaching. 

In an attempt to account for the state-of-affairs represented in the tables, the 
questionnaire data collected were carefully examined for possible antecedents. No 
clear relationships could be established, although one factor seems to stand out as 
suggestive. Generally, both elementary and secondary student teachers failed to rec- 
ognize a strong training role for their sponsor teachers or university supervisors. Uni- 
versity supervisors were seen as resource persons, evaluators, liaison persons, and 
sources of support, but not as persons whose responsibility it was to ensure that class- 
room practice was carried on at a high professional level. Surprisingly, none of the ele- 
mentary and only one of the secondary student teachers reported that either the 
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TABLE 1 


Categorization of Critical Incidents (Elementary) 


Type of Incident 
) Positive Negative Total 
Category % f % f % f 
I. Aims and Objectives ez 2 0.0 0 0.9 Zz 
II. Organization-Management 44.1 75 15.1 9 O10 84 
III. Curriculum 0.0 0 0.0 0 0.0 0 
IV. Teaching-Learning 20.0 34 1,Oree ML StL 5:6 35 
V. Professional Content 2 vs 0.0 0 0.9 2 
VI. Student teacher-Supervisor 30.0 Bi, 83.0 44 45.1 101 
Interpersonal Relationships 
Total 100.0 170 =: 100.0 54 e000 224 
TABLE 2 
Categorization of Critical Incidents (Secondary) 
Type of Incident 
Positive Negative Total 
Category % i, % f % ij 
I. Aims and Objectives 0.7 1 4.7 2 ver 3 
II. Organization-Management 47.5 65 18.6 8 40.6 73 
III. Curriculum 15 2 0.0 0 ibe 2 
IV. Teaching-Learning 24.1 33 11.6 5 2ir1 38 
V. Professional Content 3.6 4 Zed i. aoc 5 
VI. Student teacher-Supervisor 22.6 31 62.8 Zi O22 58 
Interpersonal Relationships ee ees S| eee ene, eee eee ree Sees 
Total 100.0 136 100.0 43 100.0 Ue 


sponsor teacher’s or the university supervisor’s role included providing criticism of a 
student teacher’s teaching performance. In addition, the expectations student 
teachers held for university supervisors did not include their integrating the learning 
acquired in the university classroom with school experience. It appears that student 
teachers, generally, may not expect either sponsor teachers or university supervisors 
to provide critical analyses of their teaching performances, a situation which may be 
comfortable for some sponsor teachers who may want to avoid this task, and some- 
what uncomfortable for many university supervisors who usually expect to perform 
this function (McIntyre, 1984). As Kagan (1987) reports, student teachers prefer 
“facilitative” supervision. 

Comparing the percentages of incidents in the six categories reported by the 
subjects of the distributed practicum study with those reported by the subjects in- 
volved in an extended practicum (see Figure 1), only one significant difference is ap- 
parent. As expected, student teachers in the extended practicum contrasted with 
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TABLE 3 


Categorization of Critical Incidents 
(Combined Elementary and Secondary) 


—___—_—————————————— eee 


Type of Incident 
Positive Negative Total 
Category % i % f % f 
I. Aims and Objectives TO a | 2 12 5 
II. Organization-Management 45.7 140 UL ies 17 39.0 157 
III. Curriculum 0.6 2 0.0 0 0.5 2 
IV. Teaching-Learning 21.9 67 6.2 6 18.1 73 
V. Professional Content 2.0 6 1.0 1 ie % 
VI. Student teacher-Supervisor 28.8 88 Tee 71 39.5 159 
Interpersonal Relationships ee eer ee ee Bn 
Total 100.0 306 100.0 97 100.0 403 


those in the distributed practicum reported a significantly lower percentage of critical 
incidents in the organization-management category. Even so, this category still drew 
39% of the reported critical incidents in the supervision of student teaching. The sub- 
stantial number of management and organization incidents in both practicum pat- 
terns seems understandable in light of other evidence of the importance of 
management and control. Tardif (1985) identifies control as an integral part of the 
student teacher’s “teacher identity.” Staab (1984) documents student teachers’ inse- 
curity with respect to classroom management. She found that they rated themselves 
lower on this aspect of their performances than on personal qualities, preparation for 
instruction, or the use of teaching skills. In addition, Applegate and Lasley (1985), ina 
factor analytic study of the problems facing early field experience elementary student 
teachers, identified management as a major concern—the first of seven factors. Fur- 
ther afield, class management was found to be one of the three most pressing areas of 
professional competence to be studied in the British Teacher Education Project 
begun in 1976 (Wragg, 1985). Wragg asserts that, “Inability to control a class is the 
single most common reason for failure of new teachers and it is frequently cited as a 
major worry by student teachers” (p. 202). It appears that organization and manage- 
ment are high priority concerns for Canadian, American, and British teachers and 
student teachers. 

Contrasting Tables 1 and 2, one notes two tendencies. First, within the extended 
practicum, secondary student teachers reported proportionately more critical 
incidents in the organization-management category than did elementary student 
teachers. This may have occurred because older pupils present a greater management 
challenge for student teachers or perhaps because the pressures to establish formal 
control may be greater in secondary schools. Still, neither elementary student 
teachers nor secondary student teachers in the extended practicum contributed or- 
ganization-management incidents to the degree they were found in the distributed 
practicum study in which nearly half of the total number of incidents was in this cate- 
gory. This could be interpreted as a sign that student teachers in the extended 
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Percent of Total Incidents 
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*Percent of incidents in this category differ 
significantly, statistically, at the a=.05 level 


Figure 1: Comparison of the Extended Practicum and Distributed Practicum 
Distributions of Critical Incidents over the Amended Broudy Categories 
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TABLE 4 


Categorization of Critical Incidents in Distributed Practicum Study 


——————————————————— 8383 


Type of Incident 
Category Positive Negative Total 
Subcategory % f % f % f 
I. Aims and Objectives 2.0 2 0.0 0 1.6 2 
II. Organization-Management 57.4 58 I 3 48.8 61 
III. Curriculum 0.0 0 0.0 0 0.0 0 
IV. Teaching-Learning 19:8 20 0.0 0 16.0 20 
V. Professional Content 
1. Repertory Content 1.0 1 0.0 1 0.8 1 
2. Pedagogical Content 2.0) 2 0.0 0 1.6 2 
VI. Student teacher-Supervisor 17.8 18 87.5 14 S12 30 
Interpersonal Relationships 
Total 100.0 101 100.0 25 100.0 125 


practicum may begin to give less attention to the “survival” concerns so evident in 
early practice. Secondly, secondary extended practicum student teachers also 
submitted more critical incidents in the teaching-learning category than their ele- 
mentary counterparts, perhaps reflecting a more intense subject matter emphasis in 
secondary schools. 

Interpersonal relationships between student teachers and their sponsor teachers 
or their university supervisors are very important in the student teachers’ responses 
to their school experiences.’ Within the extended practicum sample, secondary 
student teachers reported a lower percentage of interpersonal incidents, 32.2%, as 
opposed to the 45.1% reported by elementary subjects. In both groups the interper- 
sonal incidents are slightly less than half negative, 43.6% and 46.6% respectively. Al- 
though elementary student teachers reported more critical incidents in the 
interpersonal category than secondary students did, one cannot conclude their inter- 
personal relationships with supervisors were more troubled. Elementary student 
teachers as a group simply appeared somewhat more sensitive to the interpersonal di- 
mension of their experiences. This may have been a consequence of the greater child- 
or person-focused nature of their programs as contrasted to the stronger subject mat- 
ter emphasis of secondary programs. Alternately, one might attribute it to the pre- 
dominance of female subjects in the group, subjects who would therefore have been 
socialized to be more sensitive to the interpersonal dimension of their experiences 
(Macoby & Jacklin, 1974; O’Neil, 1985). 

Continuing to focus on interpersonal critical incidents and contrasting Tables 3 
and 4, one notes that there were more interpersonal incidents submitted by the 
subjects of the extended practicum than by the subjects of the distributed practicum 
study. The incidents, however, were not so likely to be negative, perhaps because the 
time available for extended practicum student teachers and their supervisors to work 
out relationship problems and to adjust comfortably to mutual role expectations was 
longer. Also, negative critical incidents may have occurred less frequently in a situa- 
tion in which there was some preparation of sponsor teachers and some serious effort 
to match student teachers with sponsor teachers as well as an interviewing process 


which helped to clarify expectations. 
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Implications for Teacher Education 

The findings of both the present study and its earlier counterpart (Housego and 
Boldt, 1985) highlight the need to sensitize supervisors of student teaching, both 
sponsor teachers and faculty supervisors, to the importance of the full spectrum of the 
problems of practice which comprise the education profession, in order that they be 
regularly addressed in the supervision process. A selection of the incidents collected in 
these studies has been used in a workshop experience designed to do this. It also 
allows supervisors to share the student teacher’s perspective on the supervision expe- 
rience. Understanding which interventions have been perceived by student teachers 
as helpful or not helpful may increase supervisors’ sensitivity and allow them to be 
more effective—to make more positively perceived and fewer negatively perceived in- 
terventions, thereby contributing to the improvement of the student teacher’s class- 
room performance. Such sensitivity might also reduce the incidence of interpersonal 
problems in the supervisory relationship. Indeed, increasing awareness of student 
teachers’ perceptions of their supervision experiences could be an important activity 
within a preparatory program for sponsor teachers, the sort of program which might 
produce the desirable results described by Copeland (1977). Copeland found that 
student teachers who were placed in the classes of sponsor teachers who were pre- 
pared for their role were more likely to take the risks required to test and acquire 
laboratory-learned teaching skills, regardless of whether the sponsor teachers demon- 
strated the skills. Student teachers who saw the skills demonstrated by unprepared 
sponsor teachers were less likely to acquire the skills, and those who had sponsors who 
were neither prepared nor likely to demonstrate the skills, of course, had little chance 
of acquiring such skills. Student teachers may benefit through submitting critical 
incidents in the supervision of their teaching, a process which could lead them to be 
more reflective about teaching, more likely to think about the consequences of their 
behavior and ways of achieving alternate outcomes. 

Further, the findings of this study highlight a need in the overall professional 
training program for clarifying the university supervisor’s role: bringing classroom 
practice and professional knowledge together, culminating in a high level of profes- 
sional classroom practice (Griffin, 1982). 

It is interesting to speculate how this could be accomplished. To begin with, it is 
recognized that many learnings about teaching are tacit and not subject to explicit re- 
call on demand by the student teacher. The task of the supervisor, in part, is to illumi- 
nate these learnings through focused discussion, in order that they be more 
deliberately applied in the classroom. Although tacit learnings can be functional with- 
out this unveiling, if the supervisor ensures that they are tested in the classroom, it is 
anticipated that the student teacher would be encouraged to become more reflective 
about his or her teaching. Part of the university supervisor’s task is to transform 
research-based knowledge of effective teaching into a form suitable for interpretive 
use; that is, to link research findings to the substantive problems of classroom teach- 
ing. Through a deliberate effort to concretize abstractions from theory and through 
the technology of lab-like exercises designed to test such ideas in the classroom, a 
powerful addition to the repertoire of craft rules usually acquired through practice 
and supervision could be added. Instead, the evidence of the present study, like its 
earlier counterpart (Housego & Boldt, 1985), seems to point to a preconception and 
enactment of student teaching supervision which appears not to recognize as fully as 
expected the intellectual maneuverability that knowledge used interpretively can 
provide the student teacher. 
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Limitations 

Student teachers’ perceptions of their supervision by faculty supervisors and sponsor 
teachers comprise but one important view of supervisory interaction. Although other 
perspectives might have been useful, within the design of this study there was no plan 
to obtain information from sponsor teachers or faculty supervisors directly. Instead, 
student teaching office personnel provided sponsor teachers and faculty supervisors 
with information concerning the study and arranged “clearance.” In order to secure 
their full cooperation, student teachers were guaranteed anonymity and were also 
assured that their grades were not a subject of study and would in no way be affected 
by their participation in the project. These conditions precluded investigating rela- 
tionships between the nature of the incidents submitted and student teaching grades 
or attempting to relate sponsor teacher or faculty advisor characteristics and behav- 
iors to the critical incidents submitted by their supervisees. 


Conclusion 

Studying student teaching supervision using critical incidents methodology is one ap- 
proach to understanding something further of the pattern of student teacher sociali- 
zation, particularly as it unfolds in supervisory interaction. With a fuller 
understanding of how student teachers respond to supervisory behaviors, it may be- 
come increasingly possible to help them progress beyond the survival concerns and 
the “how to” perspective which characterize early practice. 
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Appendix 
Code Number: 
Response Sheet 


Critical Incidents in the Supervision 
of Student Teaching in an Extended Practicum 


Date recorded: 


1. What led up to this incident? (the context of this incident) 


2. Which supervisor was involved? 
ie) faculty supervisor LJ sponsor teacher 
3. What behavior(s) on the part of your sponsor teacher or your faculty supervisor do you think 


led to improvement in your teaching or, on the other hand, delayed or impeded your 
progress? (the incident itself) 


4. Classify the above incident by checking ONE of the boxes 


[ee] positive incident a negative incident 
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The purpose of this study was to investigate the degree of burnout among secondary 
school teacher coaches and to identify factors which contributed to this burnout. The 
subjects for the study were 75 secondary school teacher coaches from three school divi- 
sions in Alberta. The subjects responded to an adapted version of Maslach’s Burnout 
Inventory and their subscale scores were then used to place them in one of 
Golembiewski’s eight phases of burnout. In addition, qualitative data on the causes of 
burnout were obtained through interviews with selected subjects. The findings of the 
study show that Golembiewski’s phase model was useful for identifying factors relating 
to higher levels of burnout in teacher coaches. In this study the gender of the teacher 
coaches, their age, the size of the school they worked in, the amount of school adminis- 
tration support for coaching, and the compensation, recognition, and rewards for 
coaching were all shown to be related to higher levels of burnout. 


In recent years considerable interest has developed in a phenomenon which is referred 
to as burnout. It has been called inter alia the “crisis of the 80s” and the “disease of 
modern life” (Maslach, 1983, p. 29). Burnout has been seen as a reaction to and a con- 
sequence of chronic occupational stress within certain professions which involve what 
is termed “people related work” (Maslach, 1978). Maslach, who has done pioneering 
research on the area of burnout, has examined the phenomenon in a number of 
people-related professions and has determined that it is comprised of three dimen- 
sions: emotional exhaustion, depersonalization, and personal accomplishment. In 


Professor Quigley is with the School of Physical Education and Athletics and is particularly interested in the study of coaching. 


Dr. Slack is amember of the Department of Physical Education and Sport Studies whose research interest is in the area of or- 
ganizational change. 


Dr. Smith is also with the Department of Physical Education and Sport Studies and is currently examining the influence of gam- 
bling and mass media in sport. 


260 


Burnout in Secondary School Teacher Coaches 


describing the phenomenon, Maslach (1982) states: 

Burnout is a syndrome of emotional exhaustion, depersonalization and reduced personal ac- 

complishment that can occur among individuals who do people work of some kind. It is a re- 

sponse to the chronic emotional strain of dealing extensively with other human beings, 
particularly when they are troubled or having problems. Although it has some of the same 
deleterious effects as other stress responses, what is unique about burnout is that the stress 

arises from the social interaction between helper and recipient. (p. 3) 

It seems reasonable to suggest that problems associated with this type of social in- 
teraction, between helper and recipient, may also be found in the interaction of those 
individuals who teach and coach in our schools. However, a search of the literature 
shows that although there have been a number of prescriptive articles (cf. Austin, 
1981; Kohlmaier, 1981; Malone & Rotella, 1981), there have only been two studies 
(Haggerty, 1982; Wilson & Chambers, 1983) that have examined burnout as experi- 
enced by coaches. Haggerty (1982) investigated the degree of burnout experienced by 
Canadian university coaches. However, his study was limited to coaching rather than 
the entire work-related stresses of university coaches. His findings show that the 
higher the coaches measured levels of burnout the more these coaches had a negative 
reaction to the coaching environment and the more they reported adverse physiologi- 
cal and psychological reactions to coaching. He suggested that coaching success as 
measured by win/loss record did not have an effect on burnout levels. Coaches who 
perceived their jobs to be stressful reported significantly more feelings of emotional 
exhaustion and depersonalization. Haggerty also reported that gender was the only 
biographic characteristic related to burnout, with female coaches reporting signifi- 
cantly less personal accomplishment than male coaches. Those individuals who 
coached female athletes also reported less intense feelings of personal accomplish- 
ment, probably due to the generally low status given to female athletic programs in 
Canada. The Wilson and Chambers (1983) study was also conducted on Canadian 
university coaches and showed that “approximately 20% ... reported serious burnout 
symptoms which would affect their job performance and the quality of their daily 
lives” (p. 1). 

These two studies show, then, that coaches often do experience symptoms of burn- 
out. However, as Haggerty (1982) pointed out: 

the coach’s duties in coaching contributed to burnout, but further investigation must con- 

sider the effects of an individual’s total work duties and responsibilities on the development 

of burnout. (p. 87) 

One area where the effects of coaching as a part of an individual’s total work and 
other related factors are likely to lead to burnout is in the country’s high schools. Re- 
cent budget and staff cutbacks have made it very difficult for high school physical 
education teachers and coaches to meet the demands of operating a high quality ath- 
letic program. As resources within their departments are reduced, there is pressure on 
schools to condense existing programs. This conflict between supply and demand has 
placed a great deal of stress on the physical education and athletic staff in our second- 
ary schools. The Canadian school population peaked in 1977 at 1.7 million and has de- 
clined since that time and is predicted to decline on into the late 1980s when a 
population of 1.3 million will be reached (Macintosh, 1979, p. 31). Declining student 
populations are responsible for the staff cutbacks, closure of schools, and the surplus 
of teachers which have subsequently resulted in the lack of job opportunities for new 
physical education graduates. The reduction of recent graduates into secondary 
schools has affected physical education and athletic programs. As Macintosh points 
out: 
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The influx of the recent graduate has been the single most important vehicle for the trans- 

mission of new knowledge and technique in sport and physical activity from the universities 

to the schools. In addition, the enthusiasm and vigour of these young persons, as is the case in 
all professions has been the source of inspiration and revitalization for physical educators al- 

ready in the system. (p. 31) 

The present situation has serious implications for the athletic programs in second- 
ary schools. Today, physical education department heads are finding it more and 
more difficult to staff their existing interschool sport programs (Macintosh, 1979). In 
many situations a handful of coaches are attempting to run the entire interschool ath- 
letic program. “Too few” are attempting to do “too much.” This conflict has been iden- 
tified as being a very real threat to interschool athletic programs and should be the 
concern of all physical educators and administrators. Macintosh expressed his con- 
cern for interschool athletics as a result of the present staffing problems: 

A large number of the interschool coaching force has traditionally been those young physical 

educators and other teachers recently recruited into the teaching field. After a number of 

years of coaching, teachers have tended to drop out of coaching, opting for less onerous extra- 
curricular responsibilities. In addition the struggle between teacher federations, which are 
advocating a reduction in the teacher-student ratio as an alternative to redundancies in the 
face of falling enrollment, and school boards which face citizen pressure to reduce school 
costs consistent with falling enrollment, will probably result in a lower teacher morale and 
greater reluctance on the part of teachers to assume coaching roles. During the next decade 
when the influx of young new teachers into most secondary schools will be minimal, critical 

shortages of the interschool coaches are likely to develop. (p. 31) 

It is, then, a critical period for high school athletics. Retention of the existing high 
school coaches is essential in order for the educational component as well as the conti- 
nuity and quality of our interschool sport programs to be maintained. Thus the issue 
of burnout becomes of greater importance when considering the future operation of 
the high school programs. Efforts to maintain the morale and enthusiasm of the exist- 
ing coaches is a key issue in the future of interschool athletics. The present teaching- 
coaching environment may in fact be contributing to the susceptibility to burnout in 
the teacher-coach. However, before any recommendations can be made, the present 
situation must be investigated. To what degree does burnout exist among secondary 
school teacher coaches? What are the contributing factors leading to burnout? It is 
only through the identification of these factors that better incentives can be provided 
for existing school based personnel. Consequently the purpose of the study reported 
here was to investigate the degree of burnout among secondary school teacher coaches 
and to identify factors which may contribute to burnout. More specifically the pur- 
pose of the study was: 


1. To determine the extent to which burnout was experienced by a sample of teacher 
coaches and to compare these findings to Maslach’s scores for her three burnout 
subscales and to the scores used by Golembiewski to establish his phase model of 
burnout. 


2. To categorize the sample of teacher coaches into the eight phases of burnout de- 
veloped by Golembiewski. 

3. ‘To examine how selected biographic factors influenced the teacher coaches phase 
of burnout. 


4. 'To examine the influence of selected school-related factors on the teacher coaches 
phase of burnout. 


Burnout in Secondary School Teacher Coaches 


Subjects 

The subjects for the study were secondary school teacher coaches from three school 
divisions operating in the province of Alberta. The teacher coaches had to have 
coached at least one major sport at the senior high school level for two of the past four 
years. In addition, one of these years of coaching must have been in the last two years. 
The teacher coaches also had to be full-time employees of their particular school divi- 
sion. Ninety-two teacher coaches within the three school systems selected met these 
requirements and consequently were included in the study. However, 17 of them were 
unable or unwilling to participate in the study. Consequently, the final study sample 
consisted of 75 teacher coaches. 


Research Instruments 
The research was conducted using three sequential and interrelated components; the 
Adapted Maslach Burnout Inventory, Golembiewski’s Phase Model of Burnout and a 
semi-structured interview. 


Adapted Maslach Burnout Inventory 

The Maslach Burn-Out Inventory (MBI) is the most widely used scale in researching 
burnout (Paine, 1983, p. 15). The MBI is a questionnaire containing 25 Likert-type 
items which measure the three dimensions of burnout mentioned earlier; emotional 
exhaustion, depersonalization, and personal accomplishment. The items in the emo- 
tional exhaustion subscale describe feelings of being emotionally extended and ex- 
hausted by one’s work. The items in the depersonalization subscale describe “an 
unfeeling and impersonal response toward recipients of one’s care or service” 
(Maslach & Jackson, 1981a, p. 101). For both of the emotional exhaustion and deper- 
sonalization subscales, higher mean scores correspond to higher degrees of experi- 
enced burnout. The subscale of personal accomplishment measures feelings of 
competence and successful achievement in one’s work with people. In contrast with 
the other two subscales, lower mean scores on Maslach’s personal accomplishment 
subscale correspond to higher degrees of burnout. Maslach used two dimensions, fre- 
quency and intensity, to describe each statement in the MBI. Golembiewski (personal 
communication, May 1984) stated that these dimensions have been found to have a 
high correlation and that respondents were “somewhat confused” by the use of both of 
them. Similarly, Haggerty (1982), who adapted the MBI to the coaching environment, 
reported that there was a moderate to high relationship between how frequently 
coaches experienced various feelings and how intensely these feelings were felt. 
Haggerty also reported that slight rewording of the MBI for the coaching population 
did not affect the validity or the reliability of the instrument. Thus the wording in 
Haggerty’s adapted version of the MBI was used for this study. Following the recom- 
mendation of Golembiewski, Munzenrider, and Carter (1983), respondents were not 
asked to rate each item twice, once for frequency and once for intensity. Rather they 
were asked to respond to each item by marking, on a seven-point Likert scale, the ex- 
tent to which each item in the adapted MBI was like them or unlike them. As 
Golembiewski, Munzenrider, and Carter (p. 464) suggest, this type of revisionism does 
not “distort Maslach’s intent or results.” The adapted MBI was administered along 
with a short questionnaire that solicited biographic and demographic data about the 
teacher coaches and also information about their coaching situation, for example, 
level coached, gender coached, and so forth. 
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Golembiewski’s Phase Model of Burnout 

Golembiewski (1983a) has used the MBI to develop a phase model approach to burn- 
out. In this model, burnout is conceptualized as a process rather than as a single event 
and eight phases of burnout are outlined. Golembiewski reported that “as burnout 
phases progress, numerous indicators of the quality of work life deteriorate” (p. 11). 
Golembiewski suggested that through studying burnout from a phase model, different 
causes and precursors of burnout may-be identified. 

Golembiewski’s model assumes different priorities or prepotencies to Maslach’s 
three subscales. Depersonalization is seen as the least potent contributor to burnout 
followed by personal accomplishment and finally emotional exhaustion. The three 
MBI subscales are dichotomized as either high or low using the group median as the 
cutoff. Based on the prepotency of the three subscales, an eight phase model of burn- 
out was developed (see Table 1). As suggested in Golembiewski’s phase model ap- 
proach, the respondents in this study were categorized according to their phase of 
burnout. 

It is very important to note that to establish his phase model of burnout 
Golembiewski, unlike Maslach, reverses the scoring system for the items comprising 
the Personal Accomplishment subscale. Thus in Golembiewski’s model, higher scores 
on Personal Accomplishment indicate higher levels of burnout. Golembiewski adopts 
this variant in order to bring consistency to the three subscales, that is, for each 
subscale, a higher score means a higher level of burnout. This format also allows 
Golembiewski to total the scores on the three subscales, and thus as the total score in- 
creases, the phase of burnout increases. 


Semi-Structured Interview 

The third component of the research instrument was a semistructured interview. The 
interview schedule was constructed in order to elicit data about areas which the litera- 
ture has indicated may cause burnout. The first area dealt with information about the 
respondents, specifically their reasons for coaching and their athletic and coaching 
background. The second area was concerned with school-related factors contributing 
to burnout. Here, data were elicited on school size, the support school administrators 
gave to the interschool athletic program, compensation for coaching, and the amount 
of recognition given to coaching. 


Method 

The 75 teacher coaches were asked to respond to the Adapted MBI and fill out the 
brief questionnaire which requested biographic and demographic data and informa- 
tion about their coaching situation. In order to determine the degree of burnout in the 
teacher coaches, their responses to the Adapted MBI were coded onto optical scan- 
ning sheets and, using the SPSSx program, scores were generated for each of the 
Adapted MBI subscales. In order to determine which phase of burnout the teacher 
coaches fell into, their subscale scores were dichotomized and, based on these results, 
respondents were placed in one of Golembiewski’s eight phases of burnout. 

Twenty-one of the 75 teacher coaches who participated in the research were then 
selected to take part in the interview component of the study. The purpose of this 
component was to elicit data about possible determinants of burnout. The selection of 
interviewees was determined by their phase of burnout. Seven individuals were select- 
ed from the most extreme phase of burnout, phase 8. Seven individuals were selected 
from phase 4, and seven from phase 1. In phases 1 and 8, the interviewees were ran- 
domly selected. However, because there were only seven respondents in phase 4, all of 
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TABLE 1 


Eight Phases of Burnout 


I II Ill IV V VI VII Vill 


Depersonalization io ii Lo Hi Lo Hi Lo Hi 
Personal Accomplishment Lo Lo Hi Hi Lo Lo Hi Hi 
Emotional Exhaustion Lo Lo Lo Lo Hi Hi Hi Hi 


—eeeeeeeeeSSSSSSSFSSSSSSSSSSSSSSSSSsseseFeseeeeeee 
(From Golembiewski, Munzenrider, & Carter, 1983) 


them were interviewed. Golembiewski (personal communication, May, 1984) has 
suggested that these three phases be selected rather than a small sample from each 
phase of burnout. These phases are seen as providing a balanced sample in that they 
represent respectively high, moderate, and low levels of burnout. 


Results and Discussion 

Degree of Burnout in Secondary School Teacher Coaches 

Maslach and Jackson (1981a) conceptualized burnout as a continuous variable, rang- 
ing from low to moderate to high degrees of experienced burnout. It is not a dichoto- 
mous variable which is either present or absent. For Maslach, a high degree of burnout 
is reflected in high scores on the Emotional Exhaustion and Depersonalization 
subscales, and in lower scores on the Personal Accomplishment subscale. A moderate 
degree of burnout is reflected in moderate scores on all three subscales. A low degree 
of burnout is reflected in low scores on the Emotional Exhaustion and Depersonaliza- 
tion subscales, and in high scores on the Personal Accomplishment subscale. Maslach 
and Jackson (1981b) indicated that scores are considered to be high if they are in the 
upper third of the normative distribution, moderate if they are in the middle third, 
and low if they are in the lower third. Maslach established norms for the three MBI 
subscales which have led to meaningful comparisons being made across different dis- 
ciplines and populations. 

The subscale means of the teacher coaches in this study can be compared to 
Maslach’s norms for each of the MBI subscales. Thus it is possible to gain an under- 
standing of the degree of burnout exhibited by the teacher coaches. The mean score 
on the Depersonalization subscale for the teacher coaches was 11.2; this compares to 
Maslach’s mean scores of 9.4 (frequency) and 11.7 (intensity). This score places the 
teacher coaches in the middle third of Maslach’s normative distribution for this 
subscale, thus indicating a moderate level of experienced burnout. In terms of 
Personal Accomplishment, the teacher coaches had a mean score of 44.7 as compared 
to Maslach’s mean scores of 36.0 (frequency) and 39.7 (intensity). This score places 
the teacher coaches in the lower third of Maslach’s normative distribution and thus 
means they are in the low level of experienced burnout as measured by this subscale. 
The teacher coaches mean score on the final subscale, Emotional Exhaustion, was 29.4 
as compared to Maslach’s scores of 24.0 (frequency) and 31.6 (intensity). This placed 
the teacher coaches in the middle third of Maslach’s normative distribution for the 
Emotional Exhaustion subscale and so indicated a moderate level of experienced 
burnout. Overall then, when compared to Maslach’s normative distributions, the 
teacher coaches in this study experienced a moderate to low range of burnout. 
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When the Adapted MBI mean scores are compared with those used by 
Golembiewski (1983b) in his phase model of burnout, we find the following results. 
The mean score for Depersonalization for the teacher-coach sample of this study was 
11.2. The mean for this subscale was lower than that of Golembiewski who reported 
the mean for Depersonalization to be 18.2. The score obtained in this study would 
only fall in the 18.8 percentile of Golembiewski’s study. Thus the teacher coaches in 
this study experienced less Depersonalization (thus contributing to lower levels of 
burnout) than Golembiewski’s sample. The mean (20.8) for the subscale Personal Ac- 
complishment was also lower than that reported by Golembiewski (25.8). A score of 
20.8 would have fallen into the 25.2 percentile range of Golembiewski’s norms. Be- 
cause these figures were obtained by reversing the scoring system on those items 
which make up this subscale, a higher score means a higher level of burnout. Teacher 
coaches then experienced a greater sense of Personal Accomplishment (thus contrib- 
uting to a lower level of burnout) than Golembiewski’s sample. Finally, in terms of 
Emotional Exhaustion, the teacher coaches’ mean (29.4) was found to be higher than 
the mean reported by Golembiewski (22.8). A score of 29.4 would fall in the 71.7 per- 
centile range of Golembiewski’s norms. Thus teacher coaches experienced greater 
Emotional Exhaustion, which contributed to a higher level of burnout than 
Golembiewski’s sample. 

Overall, the results of the study, when Adapted MBI scores are compared to 
Golembiewski’s findings, indicated that the secondary school teacher coaches experi- 
enced less Depersonalizaton (thus contributing less to burnout), a greater sense of 
Personal Accomplishment (thus contributing less to burnout), and greater Emotional 
Exhaustion (thus contributing more to burnout). This would suggest that the teacher 
coaches of this study experienced less burnout than those individuals in the large fed- 
eral (US) government agency studied by Golembiewski. However, despite these lower 
levels of burnout, they are still significant, and it is nonetheless important to offer ex- 
planation for them for as Ratsoy, Sarros, and Aidoo-Taylor (1986, p. 282) point out 
“any form of distress in education is a matter of concern.” 

In explaining the findings, then, it seems reasonable to suggest that, due to the na- 
ture of the teaching-coaching profession, reporting feelings of Depersonalization may 
be contrary to professional ideals. The low score on Depersonalization may reflect a 
social desirability effect. In other words, it would be more socially acceptable for 
workers in a bureaucratic agency to report feelings of Depersonalization than it would 
be for teacher coaches working with young student athletes. However, it is also 
suggested that the low score on Depersonalization may in fact reflect teacher coaches’ 
feelings and association with the student athletes with whom they work. The results 
of the content analysis of the interview data of this study indicated that the student 
athletes do not contribute to feelings of burnout. Instead, they are a source of satisfac- 
tion for the teacher coaches. The following excerpts were typical comments of the 
teacher coaches when interviewed about their feelings on their student athletes: “The 
students are my salvation,” “I think they are one of the rewarding things about being 
in this profession.” “Thank God for the kids! They’re the best part of the job.” This 
finding is consistent with those in a recent article by Blase and Pajak (1986, p. 317) 
who suggest that teachers experience a positive impact on their self-esteem and confi- 
dence from being able to successfully influence students. 

It may also be suggested that the motivation of the student athletes versus the “cli- 
ents” of the various workers Maslach and Golembiewski have studied may account for 
the greater sense of Personal Accomplishment reported by these teacher coaches. The 
comments of one teacher-coach interviewed in this study reflect the potential for feel- 
ings of Personal Accomplishment. 
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I basically like the socialization with the kids. I like talking to them about what they can get 
out of sport, and what they are looking for in sport, and I really like the fact that in coaching 
you can get immediate results from hard work. 

It is feasible, too, that the nature of coaching provides constant feedback to the 
coach as compared to the bureaucratic nature of the workers studied by Maslach and 
Golembiewski. This feedback goes beyond the immediate win/loss record for a given 
season. Coaches have a direct comparison of athletes’ past performances and skills by 
which they (the coaches) can measure improvement. The feedback and intrinsic grati- 
fication coaches receive undoubtedly contribute to these feelings of Personal Accom- 
plishment. As Stevenson (1984) points out: 

It seems clear that the central motivation for the extracurricular work comes overwhelmingly 

from the intrinsic reward which it provides. Teachers coach because of the satisfaction they 

derive from it. The satisfaction of achieving, of seeing the kids learn, improve and go on to 

better things. (p. 5) 

Using Maslach’s scoring convention, female teacher coaches reported lower scores 
(X=41.0) on the Personal Accomplishment subscale than their male counterparts 
(X =45.0); this difference was statistically significant at the .05 level. The finding cor- 
responds to Haggerty’s (1982) study of Canadian university coaches. He reported 
(p. 81) that male and female coaches had similar degrees of burnout for most 
subscales with the exception that “females reported less intense feelings of Personal 
Accomplishment than males.” Since female coaches in this study primarily coached 
female teams, it is suggested that the lower mean score for female coaches may be ex- 
plained by the lesser amount of attention usually given in high schools to women’s 
athletic programs (cf. Hall & Richardson, 1982). 

Female coaches interviewed in this study indicated that athletics was more of a 
central focus and priority among the high school boys than it was for the high school 
girls. One female coach reported, “it doesn’t seem to mean as much to the girls. Their 
reasons for participation are different than the boys.” 

It is difficult to determine whether the emphasis girls place on high school ath- 
letics is the result of the coaching they receive or if, in fact, the reasons for participat- 
ing on school teams are different as a result of the different socialization process. The 
recognition and profile of men’s athletics are much higher than women’s athletics (cf. 
Hall & Richardson, 1982). It is suggested that the support which accompanies this 
recognition may influence female coaches’ sense of Personal Accomplishment. 

As would be expected, coaches reporting that they had experienced a high degree 
of success over the past four years had the highest mean score (using Maslach’s scor- 
ing convention) on Personal Accomplishment (X =48.9). The mean score for coaches 
reporting “quite successful” teams was 45.5; and coaches reporting “moderately suc- 
cessful” teams had the lowest mean score (X =39.0). Teacher coaches interviewed in 
this study indicated that their definition of success went beyond a simple win/loss 
definition. Thus a sense of accomplishment in teaching and coaching measured 
through participation of the students and the development of student athletes, as 
opposed to just the win/loss record, could explain the observed relationship between 
reported success and personal accomplishment. 

The interview data suggest that the high score on Emotional Exhaustion reflects a 
work overload situation experienced by the teacher coaches rather than the cumula- 
tive product of feelings of Depersonalization and lower sense of Personal Accomplish- 
ment (as outlined by Golembiewski’s prepotencies or priorities of the subscales). The 
work overload experienced by these teacher coaches appears to be a major contributor 
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to the reported burnout. This is consistent with two recent studies: one by Ratsoy, 
Sarros, and Aidoo-Taylor (1986) which examined organizational stress among district 
based administrators, principals and assistant principals, and nonadministrative 
school-based personnel; and one by Blase and Pajak (1986) which examined the im- 
pact of work life on the personal lives of school teachers. Both studies saw work 
overload as a key contributor to stress and a major factor that had a deleterious effect 
on the personal life of teachers. The finding is also supported by a study of school ad- 
ministrators by Sarros and Friesen (1987); their research identified work load as a key 
contributor to feelings of Emotional Exhaustion burnout. 

Female coaches were particularly susceptible to high scores on the Emotional 
Exhaustion subscale (a mean of 33.6 as compared to 28.0 for male coaches); this was 
statistically significant at the .05 level. The traditional responsibilities, particularly 
for married females, of domestic activities as well as coaching and the lack of suffi- 
cient female teacher coaches may have resulted in their being placed in an even 
greater work overload situation than their male counterparts. This in turn probably 
contributed to the reported high level of Emotional Exhaustion. 

The comparison of the mean scores based on marital status supports the findings 
of Maslach. Single coaches scored higher on Emotional Exhaustion than did married 
coaches (single X=33.2; married X=28.5). It is suggested that the single teacher- 
coach may have more of a tendency to overidentify or become overly involved in his or 
her work than teacher coaches whose family commitments restrict or limit their in- 
volvement. A comment from one coach exemplified how her commitment would 
change with marriage and a family. 

Although I enjoy coaching now, my priorities will change in the future when I have a family. 

My first priority will always be my family commitments, then teaching and then coaching. If 

there is not enough time for coaching, I won’t coach. 

Teacher coaches who had taught less than 50% physical education had a lower 
mean score (26.8) on Emotional Exhaustion than those who taught 50% or more phys- 
ical education (31.7). This could be due to the fact that coaches in this study who were 
teaching more than 50% physical education, coached almost twice as many major 
sports than did coaches who taught less than 50% physical education. The former 
group reported coaching an average of 3.7 major sports over the past four years as 
compared to 6.1 major sports for coaches who had taught more than 50% physical 
education. Excessive involvement in coaching was reported as a major source of burn- 
out among teacher coaches interviewed in this study. The responsibility for staffing 
the interschool athletic program falls on the shoulders of the physical education de- 
partment. Physical education teachers reported that they were expected to coach: “It 
comes with the territory,” “It’s an expectation of the administration and it’s an expec- 
tation I hold myself.” Stevenson (1984) reported similar findings when he interviewed 
high school teacher coaches in eastern Canada. 

It is understood by everyone involved that this is what physical education teachers do. It is an 

expectation which is held by their colleagues in the physical education department. You 

coach because everyone coaches, and if you don’t then someone else will have to do “your” 
load in addition to their own. And of course the students expect it... . Through a process of 
socialization, they have come to incorporate this extracurricular work as an expected compo- 

nent of the physical education teacher’s role. (p. 3) 

It may be seen, then, that the indicators of burnout among teacher coaches vary 
from other occupations. Essentially, teacher coaches experience less Depersonaliza- 
tion, greater Personal Accomplishment, and far more Emotional Exhaustion, mainly 
due to an overload situation. 
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When the scores of the subscales are dichotomized and used to place respondents 
into the phases of burnout as developed by Golembiewski, we find that just over 53% 
of teacher coaches fall in the upper four phases of burnout. As a result, although the 
teacher coaches in this study report experiencing less burnout than the respondents in 
both Maslach’s and Golembiewski’s studies, there are over half who fall into the upper 
phases of experienced burnout (see Table 2). Consequently, it becomes important to 
attempt to establish what factors contribute to these feelings of burnout. 


Factors Influencing the Phase of Burnout in 

Secondary School Teacher Coaches 

Initial studies on burnout focused on the differing psychological attributes of individ- 
uals and how these may be a cause of burnout. However, more recent work (cf. Caroll 
& White, 1983; Cherniss, 1980) has moved from this focus on individual human agents 
to a more structural approach. Consequently, rather than asking the question who is 
to blame for burnout, they are asking what is causing burnout. Maslach (1982) points 
out in reference to the who versus the what question of burnout that: 

the question points us in other directions that encourage us to substitute an analytical micro- 

scope for our people watching binoculars. Rather than just looking at defective people, we 

focus our attention on the situation in which they find themselves. (p. 9) 

In this study a semistructured interview was used to collect data about a number of 
biographic and school-related factors which the literature has indicated may influ- 
ence burnout. In this section, information is present on the influence of several of 
these factors, specifically in relation to an individual’s phase of burnout. 

An analysis of the gender of respondents in terms of their phase of burnout 
showed that 63.1% of the females in this study were in the upper four phases of burn- 
out (V-VIII); this compares with 49.0% of males who were in similar phases. Female 
coaches in this study also reported coaching more sports over the previous four years 
than had their male counterparts. Female coaches had coached an average of six dif- 
ferent sports as compared to males who had coached an average of 4.6 different sports 
over the past four years. It is feasible to suggest that the higher percentage of women 
in the upper phase of burnout is a result of the overload that these women carried in 
order to maintain the women’s athletic programs in their schools. An extract from an 
interview with one female coach who was in Phase VIII of burnout exemplifies this. 

When I first started here I was the only woman on the physical education staff and there were 

all sorts of teams to coach. ... I did everything. ... But I’m beginning to not enjoy it so much, 

and feel that it is because I have done so much for so long. 
In addition to the concern expressed by female coaches about their overload, all of 
those interviewed in this study noted that they experienced conflict between their 
family role and their coaching role. Those who did not have families indicated they 
could foresee a conflict with coaching when they started a family. This former type of 
conflict becomes more evident when the marital status of male and female coaches is 
compared to their phase of burnout. Seventy-five percent of married females were in 
burnout phases V to VIII whereas only 47.5% of males fell into these categories. As 
Pines and Aronson (1981, p. 87) have indicated, the expectations that society has 
traditionally set for women as homemakers, coupled with the additional expectations 
outside the home, have resulted in an “external situational stress and internal emo- 
tional stress, both affecting the working woman’s self-perception as a competent hu- 
man being.” With the lack of opportunity for new physical education graduates in the 
teaching-coaching field, women’s athletic programs will probably suffer more than 
men’s. As overloaded women in the teaching-coaching profession begin to experience 
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TABLE 2 


Frequency and Percentage Distribution of Coaches 
Across the Eight Phases of Burnout 


Phase ~ Frequency Percentage 
Phase I 15 20.0 
Phase II fi 9.3 
Phase III 6 8.0 
Phase IV | 9.3 
Phase V iy 193 
Phase VI 6 8.0 
Phase VII i 9.3 
Phase VIII 20 26.6 


burnout or as greater familial demands are placed on them, thus causing greater de- 
grees of burnout, they are more likely to leave coaching. This will undoubtedly result 
in increased difficulties in staffing women’s athletic programs. However, it is impor- 
tant to note that the point being made here is not just a problem for women; it is a 
structural problem which affects women in coaching because of the general societal 
expectation that they play the major role in domestic labor. 

Another biographic factor which has been shown to influence the level of burnout 
is age. In this study the average age for coaches in Phase I of burnout was approxim- 
ately five years older than those in Phase VIII. Similarly, the number of years coached 
at the secondary school level revealed that the mean for coaches in Phase VIII was 6.9 
years as compared to 11.3 years for coaches in Phase I. The results of this study would 
appear to support Maslach and Jackson’s (1981b) findings that younger and less ex- 
perienced workers are more prone to burnout. The younger coaches in Phase VIII re- 
ported factors such as overly idealistic, overloaded, overcommitted, lack of confidence 
due to inexperience, and lack of security in their job as possible contributors to burn- 
out. These factors are similar to those reported by other researchers in their studies of 
burnout (cf. Cherniss, 1983; Maslach, 1982; Pines & Aronson, 1981). 

Several of these researchers have also indicated that the ability to recognize one’s 
limitations and capabilities reduces the likelihood of burnout. Freudenberger and 
Richelson (1980) suggested that the greatest protection against burnout is self-aware- 
ness and recognizing the need for balance. This often comes with experience. A more 
experienced coach in one of the lower levels of burnout exemplified this when he 
noted, “When I first got into coaching I always felt that I had to prove myself as a 
coach and prove my team to others. As time progressed, I quickly dispelled that and 
... [became more realistic.” 

Older, more experienced coaches appear to know their limitations and have at- 
tempted to reduce their commitment to coaching over the years until they have 
reached a compromising balance. In the past, teacher coaches could reduce their com- 
mitment to the number of teams they had to coach, as there were always younger 
teachers moving into the system who were eager to fill these vacant coaching posi- 
tions. However, teacher coaches today are finding themselves in situations where they 
cannot reduce or even take a break from coaching because there are few young “relief” 
coaches entering the school system. They are faced with continuing to coach for an ex- 
tended period and thus are caught in an environment which makes them more 
susceptible to burnout. 
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In addition to certain biographic characteristics, a number of school-related fac- 
tors were examined in order to determine whether they influenced burnout. One of 
the factors which was examined was the size of the schools in which the teacher 
coaches worked. It was found that 43.7% of respondents who taught and coached in a 
school of under 1000 pupils fell into phases VII and VIII of burnout, whereas the cor- 
responding percentage of teacher coaches in these phases in schools with over 1000 
pupils was 22%. From these figures it would appear that teacher coaches from smaller 
schools reported higher degrees of burnout. It is feasible to suggest that because there 
are fewer staff members teaching in smaller schools, the staffing of interschool sports 
programs is made more difficult. Therefore, if smaller schools attempt to offer the 
same number of competitive teams as larger schools, the teacher coaches will be in an 
overload situation which can lead to burnout. 

Another school-related factor which was seen to contribute to burnout was a lack 
of support by the school administrators for the interscholastic athletic programs. 
Among the coaches interviewed, those in Phase VIII of burnout reported lack of 
school-related support more frequently than coaches who were in Phases I and IV. 
These findings correspond to those reported in the literature which indicate that in 
organizations where support is provided, lower levels of staff burnout tend to be found 
(cf. Cherniss, 1980; Ellenburg, 1981; Maslach, 1982). They also support the findings of 
Ratsoy, Sarros, and Aidoo-Taylor (1986, p. 275) who, in their study of district based 
administrators, principals and assistant principals, and nonadministrative school 
based personnel reported “unsatisfactory relationships with superordinates” as a fac- 
tor which caused stress in these individuals. Administrative support, or lack of it, was 
seen by the teacher coaches as being manifest in the school policies related to 
interschool athletics. Specifically, the lack of clear school policies regarding the role of 
interschool athletics was far more frequently reported by individuals in Phase VIII of 
burnout than by those in Phases I or IV. The lack of clear policy was seen to lead to 
role ambiguity as teacher coaches were unsure of the importance they should give to 
interschool athletics vis-a-vis their classroom responsibilities. Both Farber (1983) and 
Cherniss (1980) have reported that this role ambiguity can lead to high levels of burn- 
out. More recently, Ratsoy, Sarros, and Aidoo-Taylor (1986, p. 274) reported role 
ambiguity to be a factor causing stress in both administrative and nonadministrative 
school based personnel. 

Other forms of school-related nonsupport seen as contributing to burnout were 
the lack of compensation for coaching, the lack of recognition, and the lack of an equi- 
table reward system. Of the teacher coaches in the upper phases of burnout (V to 
VIII), 62.5% did not receive any compensation for coaching. However, very few 
wished to see coaches paid for their services. It was the general feeling that there were 
other means of compensation which were more appropriate, the most frequently cited 
being time off from regular teaching duties. Coaches at all phases of burnout indicat- 
ed that inequitable reward and recognition systems placed stress on them. Many 
suggested that teacher coaches who were involved in extracurricular activities should 
be given special consideration in the delegation of other out-of-classroom duties. 


Conclusion 
This study was concerned with investigating the degree of burnout in secondary 
school teacher coaches and identifying factors which may contribute to this burnout. 
It was found that teacher coaches experienced low to moderate ranges of burnout 
when compared to Maslach’s normative distribution scores. Teacher coaches reported 
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a moderate amount of Depersonalization and Emotional Exhaustion but a greater 
sense of Personal Accomplishment than Maslach and Jackson’s (1981b) population. 
This was attributed to the positive relationship between the teacher coaches and their 
athletes and the direct feedback on job performance provided by the coaching situa- 
tion. 

When the results from the teacher coaches were categorized according to 
Golembiewski’s Phase Model, it was found that teacher coaches generally experienced 
less feelings of Depersonalization and greater feelings of Personal Accomplishment, 
thus contributing to lower burnout scores than Golembiewski’s (1983b) sample. 
Teacher coaches did, however, score higher on Emotional Exhaustion. Nevertheless, 
although teacher coaches generally experienced less burnout than Maslach’s and 
Golembiewski’s samples, over 53% fell into the higher levels of burnout. 

The findings of this study do support Golemhbiewski’s phase model approach to 
burnout as a useful instrument for helping to determine the precursors of burnout as 
experienced by a particular occupational group. In the sample of teacher coaches 
studied in this research, a number of factors were shown to be associated with the 
upper levels of burnout. Although further research is needed before causality could be 
established, the present research does help identify variables which are a potentially 
useful starting point for work of this nature. 

Although the results of this study support the usefulness of Golembiewski’s 
phases model as helping to determine precursors of burnout, they show less support 
for the prepotencies of the subscales as proposed in his model. These prepotencies are 
based on the fact that increased feelings of Depersonalization are seen as leading to 
decreased feelings of Personal Accomplishment which in turn result in increased 
Emotional Exhaustion. Based on this progression, it is difficult to explain the mean 
subscale scores reported in this study as compared to those established by 
Golembiewski. With the lower scores on Depersonalization and Personal Accomplish- 
ment, one would expect this sample to have also reported a lower mean score than 
Golembiewski’s sample on the Emotional Exhaustion subscale. It is the teacher 
coaches’ higher score on Emotional Exhaustion which does not support 
Golembiewski’s prepotencies of the three subscales. There are a number of possible 
explanations for this phenomenon. These include the potential of a Social Desirability 
Affect and the fact that other work-related sources may contribute more to teacher- 
coach burnout than coaching itself. These possibilities require further study. 

In addition to the above comments on Golembiewski’s phase model, there are two 
other points emanating from this study that relate to the model and are worthy of 
note. The first point is methodological in nature and relates to the reversing of the 
scoring system for the items in the Personal Accomplishment subscale. Although this 
procedure is necessary in order to be able to properly establish the phase model, it is 
somewhat confusing, particularly when trying to compare results with Maslach’s 
norms which have been scored in the opposite manner. The second point worthy of 
mention relates to the dichotomizing of the MBI subscale scores to rank individuals as 
either high or low. This convention may not be sufficiently discriminating, and as 
Golembiewski, Munzenrider, and Carter (1983, p. 480) note, until norms for burnout 
phases are developed, this convention should be considered as convenient rather than 
definitive. The sequencing of the low and high scores into the eight phases of burnout 
(see Table 1) also raises some questions. This sequencing emanates from the 
prepotencies of the subscales based on the questions raised in this study about these 
prepotencies; the high/low sequencing in each of the phases is worthy of additional 
study. 
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Notwithstanding these points, the Golembiewski phase model does, as mentioned 
above, represent an extremely useful tool for identifying factors relating to higher 
levels of burnout. In this study, the gender of the individual, their age, the size of the 
school in which they coached and taught, the amount of school administration sup- 
port for coaching, and the compensation, recognition, and rewards for coaching were 
all shown to be linked to the higher levels of burnout. 
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Information Integration Theory Extended: 
Cognitive Coding in Young Children 


A set of tests for simultaneous and successive coding was constructed and administered 
to a sample of young children (Grades Kindergarten-2). The objectives of the study 
were to examine the consistency of the coding factors and to provide tests which would 
be sensitive to children’s development. The tests were constructed to reflect a range of 
input and output demands including both sensory and behavioral components. Results 
of the factor analysis clearly validated the two coding factors and a factor of response 
speed. A MANOVA was then performed to determine which increments in test per- 
formance could be attributed to age differences. The results are discussed tn terms of 
the potential of these tasks to identify cognitive processing problems in young children. 


Efforts to move away from traditional psychometric approaches when examining cog- 
nition have relevance with young children, particularly when one is attempting to pre- 
dict later academic progress. The expedience of attending to cognitive functioning, 
and potential problems in the latter, when children are at preschool or beginning 
school ages has been discussed by Latorne (1985). Neurologically, the child at this age 
may be conceived of as more “plastic,” with a better prognosis for central nervous sys- 
tem adjustments (Satz, Friel, & Rudegeair, 1976). Psychologically, the younger child 
will be less likely to have suffered a loss of self-esteem due to failure academically 
(Green, Lyles, & Eissfeldt, 1980), and financially a child needing assistance may re- 
ceive it prior to a need for more intensive and expensive attention (James, 1982). 
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Greenfield and Scott (1985) have outlined a critique of current screening devices 
for preschool children, indicating that the former have typically been based on intelli- 
gence test content, and that such tasks reflect a general problem solving ability that is 
not likely to identify children with specific cognitive processing deficits. In the 
opinion of these authors, “Unless a preschool screening test assesses active cognitive 
processing, children who have deficits in particular cognitive areas will not be identi- 
fied” (p. 44). Greenfield and Scott propose the alternative of “game-like cognitive 
tasks.” | 

Cognitive deficits must be defined within the context of a theory of cognitive or in- 
tellectual functions. Even when the theory is not explicitly mentioned, it is implicit in 
any system of classification. When we classify abilities or disabilities, we operate from 
certain theoretical assumptions about cognitive abilities. Therefore, screening tests 
for cognitive processing should be chosen or constructed in accordance with a theory. 

Information integration theory provides such a process-based approach to cogni- 
tion (Das, Kirby, & Jarman, 1975; 1979). Based on the work of Soviet psychologist 
Luria, the proposition is put forward that the brain employs two mutually dependent 
ways of coding information and that these, along with an organizational “planning” 
component, underlie all forms of cognitive processing. Simultaneous coding involves 
the synthesis of various separate elements into groups, with spatial overtones such 
that all parts of the group are accessible regardless of their position in the group. 
Simultaneous processing is occurring when we glance at a painting and survey it as a 
whole, integrating all of the spatial relationships, or when we listen to an orchestra 
and all of the rhythms, pitches, and tones are integrated into a holistic music scheme 
or unit. Such a synthesis of relationships is necessary for more academically complex 
related tasks such as the comprehension of numerical and geometrical concepts, the 
understanding of the grammatical system of a language, and the comprehension of 
written prose. 

Successive synthesis involves the integration of separate elements into series or 
groups that are related temporally rather than spatially. Luria described such ele- 
ments as being integrated in a chain, with each link activating the next link in a con- 
tinuous but serial pattern. Successive coding is reflected in activities such as decoding 
print, producing a series of written letters, memorizing a list of times, and producing 
or interpreting narrative speech. 

Planning ability appears to be somewhat superordinate to the coding functions, 
and involves the generation and execution of programs, and the use of foresight and 
flexibility. This is apparent, for example, when a child demonstrates strategic intent 
within a task, such as the planning and organization of a written composition. 

Obviously the coding and planning functions are implicated in most academic 
learning and classroom activities. Each child is viewed as having the simultaneous and 
successive coding processes and planning strategies available for use in learning to a 
greater or lesser degree, depending on the individual’s own processing abilities and 
preferences, and also on the particular task demands. 

Over the past two decades, Das and his colleagues have been involved in successful 
efforts to operationalize the concepts of simultaneous and successive coding and plan- 
ning, and validate respective marker tasks across samples divergent in age, IQ, learn- 
ing ability, and cultural background (where “marker task” implies those which are 
differentiated clearly and in line with theoretical predictions for the two types of cod- 
ing and planning within a factorial analysis, Das, 1972; 1973; 1985; Das & Molloy, 
1975; Snart & Swann, 1981). 
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One recent attempt to incorporate the essence of information processing theory 
into a practical assessment battery for children has been the K-ABC (Kaufman As- 
sessment Battery for Children, 1983). The K-ABC includes sets of subtests to meas- 
ure simultaneous processing, successive processing, and achievement or school related 
skills. Drs. Kaufman were both involved with the development of the McCarthy 
Scales and the WISC-R, and acknowledge the centrality of information integration to 
the development of the K-ABC: “Factor analysis and coefficients of correlation with 
Das, Kirby & Jarman’s (1975; 1979) battery of successive and simultaneous tests offer 
strong support for the construct validity of the processing dichotomy underlying the 
K-ABC Mental Processing Scales” (Kaufman & Kaufman, 1983, p. 3). Thus the intent 
of the K-ABC was indeed to move away from a post hoc analysis of mainly quantita- 
tive analyses of children’s performance and to incorporate a dimension of ongoing 
processing capabilities. 

Since it is a recent addition to the domain of assessment, it is natural that the 
K-ABC undergo a critical analysis in the literature. Though it has been described pos- 
itively as “perhaps the first standardized battery of intelligence tests that attempts to 
measure processes” (Das, 1984c, p. 229), there is also a caution on the same page that 
the Kaufmans have “initiated but not completed the job of constructing a process- 
based test of cognitive competence.” More details of the latter suggestion are provided 
elsewhere (Das, 1984c; 1984d). The K-ABC apparently has something to offer, partic- 
ularly in the sense of increasing our breadth of understanding of children’s achieve- 
ment or underachievement in school. For children at beginning school ages, the issue 
appears somewhat equivocal, and increased documentation of the potential for infor- 
mation integration theory and assessments to be “predictive” of school performance is 
necessary. Development of tasks for young children in addition to those of the 
K-ABC, though also administered and analyzed in terms of processing strategy rather 
than quantitative score per se, are therefore part of this continuing effort. Tasks simi- 
lar to those used traditionally as ability measures, such as the xylophone and picture 
memory tasks of the McCarthy Scales, but administered and analyzed in this new 
light, are appropriate inclusions within such a battery. 

It is apparent from research reported in the K-ABC Interpretive Manual and by 
Das (1984a) that the coding processes develop essentially in a hierarchical fashion, 
though continued exploration seems to indicate that successive processes may be 
utilized earlier than simultaneous ones. If it is possible to validate further marker 
tasks for simultaneous and successive coding on populations of young children, which 
is the main purpose within the present study, and follow a sample of students to de- 
termine the predictive value of the tasks, then the benefits obviously exceed even the 
knowledge of correlations between processing strengths and weaknesses and actual 
school performance later. Students’ performance on the present tasks will comprise a 
set of baseline data from a group of young children prior to the identification of prob- 
lems in achievement. A longer term follow-up will be possible relating to strengths 
and weaknesses in coding to academic performance. 

The objectives of the present study therefore were: 

1. To provide a further extension of the information integration approach to cogni- 
tion to children as young as five years of age. 

2. To develop asample of appropriate and motivating marker tasks for simultaneous 
and successive coding for children ranging in age from five to about eight years. 

3. To assess the consistency of factor analytic results with young children, with those 
established for older students and adults, and thus examine the validity of the 
marker tasks. 
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4. To provide some initial comments as to a relationship between chronological age 
and the development of simultaneous and successive coding. 


Method 

Subjects 
Students participating in the present study were members of two classrooms at each 
of kindergarten (N= 48), grade 1 (N=43) and grade 2 (N=44). The classrooms repre- 
sented the total number of English speaking classes at each grade level in an 
Edmonton school which also offered French immersion as an alternative. Since the 
school population included students from a middle class neighborhood, as well as 
those who were bussed from lower socioeconomic neighborhoods, the range of stu- 
dents sampled was representative of the city’s population of young school children. 

Parent permission was sought prior to assessing the children, and in no case were 
we denied permission. Thus we were able to access all students in the six identified 
classrooms. Testing was carried out in a quiet room at the school and the order of test 
administration was counterbalanced across students. Each child was seen by one of 
the authors (Barriault). She was able to develop excellent rapport with the young chil- 
dren, who were seen individually for testing. The average time per assessment was 35 
minutes. The researcher was in the school over the course of six weeks to complete 
testing. 


Coding Tasks 

An attempt was made to create tasks that would be appealing to children in the early 

grades, tap a range of sensory and behavioral areas in terms of input and output 

demands, and reflect a coding dimension in either the simultaneous or successive do- 
main. It is noteworthy that care was taken during task development to avoid an align- 
ment of specific cognitive demands with potential marker tasks for either 
simultaneous or successive coding. For example, we included tasks requiring memory 
ability, visual-motor responses, and nonverbal responses in both sets of marker tests. 

Input for specific tasks in each section involved both vision and audition. Thus a 

potential confounding variable for the interpretation of results was significantly re- 

duced. Three tasks were developed as potential marker tasks for successive coding, 
based on previously known marker tasks for older children, (Brailsford, Snart, & Das, 

1984) and three were developed for simultaneous coding. 

The following tasks were predicted to be successive marker tasks: 

1. Imitation of Rhythm (Xylophone). The child is required to reproduce a series of 
progressively more difficult sequences on the xylophone. Series range in length 
from three to seven notes, and the final score consists of the total number of cor- 
rect taps. This task provides auditory, visual, kinesthetic, and tactile input, and 
requires visual-motor and auditory-motor output. 


i) 


Moveable Sequences. The child is required to reproduce various series of pictures 
by sliding cards through a frame apparatus until they match the researcher’s 
model. Input here is visual and output is visual-motor. 


3. Identification from Memory. This task involves presentation of a number of series 
of familiar pictures to the child which are then covered, with only the bottom of 
each picture frame remaining in view. In each case the child is then asked to iden- 
tify certain cards by pointing. There are 10 trials, involving series from three to 
eight pictures in length, and the score consists of the number of pictures correctly 
recalled. Input is visual and output is visual-motor. 
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The following tasks were predicted to be marker tasks for simultaneous coding: 


1. Memory for Design (Turtles). Similar to the adult version of this task, the child is 
required to draw from memory a series of six simple line drawings. To heighten 
motivation the designs are presented and reproduced on the blank shell of a draw- 
ing of a turtle. Scoring guidelines are patterned after those used with adults, such 
that for the seven items, each could receive a score of 0, 1, 2, or 3 according to the 
maintenance of correct proportions and relationships, rather than exactitude of 
detailed replication. This is particularly important because even with these simple 
drawings one would not want the emphasis on visual-motor coordination per se. 
Input is visual and output is visual-motor. 


2. Silhouette Match. The child is required to match 10 concrete silhouettes with 
their partially outlined counterparts in a gameboard format. The child is timed as 
to the appropriate sorting of the cards following a trial item. Scoring also consists 
of the number sorted correctly. Input is visual and output is visual-motor. 


3. Reveal a Section Puzzles. The child guesses the content of a series of pictures as 
he or she is presented with puzzle pieces of each picture in a cumulative manner. 
Input is visual and output is verbal. 


Results 

Results on the coding tasks for the total number of students (N=130) were submitted 
to a principal components analysis followed by Varimax rotation, with eigenvalues 
greater than one used as the criterion for rotation (see Table 1). Such rotation resulted 
in the emergence of three factors representing simultaneous coding, successive cod- 
ing, and response speed. These three factors have been outlined in earlier studies 
(Das, Kirby, & Jarman, 1975), but the tests used were different; thus the factors seem 
to have generality across different test batteries. 

To examine the homogeneity of the correlations for the coding tasks at 
kindergarten, grade 1 and grade 2, these three correlation matrices were compared 
using the asymptotic chi-square test of equality (Jennrich, 1970). This test indicated 
no differences between the matrices across the three groups. 

A multivariate analysis of variance procedure (Morrison, 1967; Finn, 1974) was 
used to examine the marker task scores for simultaneous and successive processing 
across the three age groups. The speed scores were not included because these are not 
central to the information integration theory. A Wilks Lambda Criterion indicated 
significant differences between the groups (F=8.94, df= 12/244, p<0.001). A follow- 
up was performed on the individual pairs of means using simultaneous confidence in- 
tervals, and the results are summarized in Table 2. 

The latter results are reported in order to establish the sensitivity of each test to 
developmental differences. Since the adjacent groups of children were very close in 
age, most of the comparisons did not show significant differences between K and 
Grade 1, and between Grades 1 and 2, even though there was a significant develop- 
mental trend as evidenced from the MANOVA results. The four simultaneous tests, 
now including the Xylophone task, were examined first. Memory for Designs 
(Turtles) was the “best” test: All adjacent groups differed significantly. Xylophone 
scores showed significant developmental increments from Grades | to 2, but not from 
K to Grade 1. In Silhouette Matching, scores increased significantly from K to Grade 
2. Reveal a Section was the most disappointing; none of the group differences was sig- 
nificant. Among the successive tasks, Moveable Sequences scores were significantly 
greater for Grade 2 than for K; no other differences were significant. ID memory 
scores did not show significant increments. 
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TABLE 1 


Orthogonal (Varimax) Rotations of 
Three Factors Following Principal Component Analysis* 


(N=180) 
‘Orthogonal Factors 

Variable Mean SD Simultaneous Successive Speed h? 
Xylophone 30.68 5.15 Wl 081 O16 632 
Moveable Sequences 20.02 3.39 PPT 747 104 646 
Identify from Memory 9.24 oe 033 844 = 117i 743 
ID from Memory (time) 42.72 17.14 166 UCT 182 568 
Silhouette Match 4.89 2.02 656 — 258 301 588 
Silhouette Match (time) 142 Ae OAo5 —090 —001 805 655 
Reveal a Section 10.67 172 676 027 —138 477 
Memory for Designs (Turtles) 7.80 4.09 816 178 2a ANG! 
° Total variance 45.68 27.45 26.87 
*decimals omitted from factor loadings 

- TABLE 2 


Summary of the Multiple Comparisons for Task Means 


GROUP 
Total 
Possible Kindergarten Grade 1 Grade 2 

Score (N=45) (N=44) (N=46) 
Moveable Sequences Zi 18.44 20.00 21.59 
Identification 
From Memory 18 8.56 9.56 9.59 
Silhouette Match 8 4.05 4.81 5.79 
Reveal a Section 16 eek 10.81 Lee, 
Memory for Designs 
(Turtles) 18 4.61 7.65 11/07 
Xylophone Task 43 a AMS) 30.37 34.38 


Note: A line under two or more means for a given task indicates that those means do not differ at 
the .05 level; that is, the confidence interval for such a pair of means contains zero. 
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Discussion 

The factor patterns which emerged following the overall factor analysis were in line 
with the authors’ expectations and past research with older children and adults (Das, 
1984a, 1984b). The only task that had not been correctly predicted was the Xylophone 
marker task for successive coding, which actually loaded with the simultaneous tasks 
within the factor matrix. The latter finding may be understood and justified on a post 
hoc basis, with further attention to the actual task demands. When children were 
asked to replicate sequences on the Xylophone, we assumed that the cognitive coding 
in order to respond would necessarily involve successive processes. However, it ap- 
pears that in many of the earlier items children could also have responded on the basis 
of the total visual pattern of the sequence or grouped them in simultaneous chunks 
(e.g., every second bar on the xylophone). This holistic response would have altered 
the task demands cognitively such that they would fall into the domain of simulta- 
neous coding. As introduced earlier in the paper, the coding strategies accessed for 
each task are influenced not only by the child’s available and preferred strategies, but 
also by the task design and demands. 

Apart from the Xylophone task, there is clearly support for the validity of the 
marker tasks for young children as they are capable of reflecting successive and simul- 
taneous coding. Alterations in difficulty levels for some of the tasks and slightly more 
sophisticated scoring criteria will be developed as research with these tasks proceeds 
to construct a viable screening battery. However, even in the present form, it is evi- 
dent that the marker tasks are motivating, age-appropriate, and worthy vehicles to 
provide a further extension to information integration theory. 

The mean scores presented in Table 2 indicate a predictable improvement in the 
actual performance on all the coding tasks as children developed chronologically. 
Since there were no difficulties with either floor or ceiling effects for any of the tasks, 
it appears that the task demands represented a reasonable range of difficulty for chil- 
dren at these ages. What remains to be examined with particular interest is the rela- 
tive strength and growth of each type of cognitive coding in students who are 
particularly successful, or unsuccessful, academically. 

With the data available to this point, it is reasonable to speculate that the infor- 
mation integration approach indeed has potential for use with young children. It can 
be used to provide an early screening device in school, and consequent to screening, 
the model can lead to remedial measures to help prevent academic underachieve- 
ment. Cognitive deficit in either simultaneous or successive processing can be open to 
remediation. Appropriate remedial programs based on the model have been success- 
ful in improving reading (cf. Brailsford, Snart, & Das, 1984) in older children. Com- 
parable programs for kindergarten children have the potential to help prevent 
development of reading and arithmetic disabilities. 


Notes 


1. This research was supported by a grant from the Central Research Fund, University of Alberta. * 
2. More detailed information about the assessment tasks may be obtained from the authors, c/o Developmental Disabilities 
Centre, 6-123D Education North, University of Alberta, Edmonton, Alberta, T6G 2G5. 


281 


F. Snart, J.P. Das, and B. Barriault 


References 


Brailsford, A., Snart, F., & Das, J.P. (1984). Strategy training and reading comprehension. 
Journal of Learning Disabilities, 17, 287-290. 

Das, J.P. (1972). Patterns of cognitive ability in nonretarded and retarded children. American 
Journal of Mental Deficiency, 77, 6-12. 

Das, J.P. (1973). Cultural deprivation and cognitive competence. In N.R. Ellis (Ed.), 
International review of research in mental retardation (Vol. 6). New York: Academic Press. 

Das, J.P. (1984a). Intelligence and information integration. In J. Kirby (Ed.), Cognitive 
strategies and educational performance (pp. 13-31). New York: Academic Press. 

Das, J.P. (1984b). Aspects of planning. In J. Kirby (Ed.), Cognitive strategies and educational 
performance (pp. 35-50). New York: Academic Press. 


Das, J.P. (1984c). Simultaneous and successive processes and K-ABC. Journal of Special 
Education, 18(3), 229-238. 

Das, J.P. (1984d). Review of Kaufman Assessment Battery for Children (K-ABC). Journal of 
Psychoeducational Assessment, 2, 83-88. 


Das, J.P. (1985). Remedial training for the amelioration of cognitive deficits in children. In 
A. Ashman & R. Luria (Eds.), The education and training of the mentally retarded 
(pp. 215-244). London: Croom Helm. 


Das, J.P., Kirby, J.E., & Jarman, R.F. (1975). Simultaneous and successive synthesis: An 
alternative model for cognitive abilities. Psychological Bulletin, 82(1), 87-103. 


Das, J.P., Kirby, J.E., & Jarman, R. (1979). Simultaneous and successive cognitive processes. 
New York: Academic Press. 


Das, J.P., & Molloy, G.N. (1975). Varieties of simultaneous and successive processing in children. 
Journal of Educational Psychology, 67, 213-230. 


Das, J.P., & Varnhagen, C.K. (1986). Neuropsychological functioning and cognitive processing. 
In G. Hynd & J. Obzurt (Eds.), Child neuropsychology (Vol. 2, pp. 117-140). New York: 
Academic Press. 


Finn, J.D. (1974). A general model for multivariate analysis. New York: Holt, Rinehart & 
Winston. 


Green, C., Lyles, C., & Eissfeldt, A. (1980). Kindergarten diagnosis and training: A program 
report. In D.J. Sawyer (Ed.), Disabled readers: Insight, assessment, instruction (pp. 50-56). 
Newark, DE: International Reading Association. 


Greenfield, D.B., & Scott, M.S. (1985). A cognitive approach to preschool screening. Learning 
Disabilities Research, 1(1), 42-49. 
James, J.M. (1982). Evaluating early identification programs. Ontario Psychologist, 14, 31-33. 


Jennrich, R.L. (1970). An asymptotic test for the equality of two correlation matrices. Journal of 
the American Statistical Association, 65(30), 10-15. 


Kaufman, A.S., & Kaufman, N.L. (1983). Kaufman Assessment Battery for Children— 
Administration and scoring manual. Circle Pines, MN: American Guidance Service. 


Latorne, R.A. (1985). Kindergarten screening: A cross validation of the Florida Kindergarten 
Screening Battery. Alberta Journal of Educational Research, 31(3), 174-190. 


Morrison, D.F. (1967). Multivariate statistical methods. New York: McGraw-Hill. 


Satz, P., Friel, J., & Rudegeair, F. (1976). Some predictive antecedents of specific reading 
disability: A two-, three- and four-year follow-up. In J.T. Guthrie (Ed.), Aspects of reading 
acquisition (pp. 111-140). Baltimore: Johns Hopkins University. 

Snart, F., & Swann, N. (1981). Cognitive processing and employability, Stage 1. Journal of 
Practical Approaches to Developmental Handicap, 4(3), 14-22. 


282 


The Alberta Journal of Educational Research Vol. XX XIII, No. 4, December 1987, 283-291 


ANNA THOR 


and 


RUTH E. BERRY 


University of Manitoba 


A Canadian Assessment of 
Consumer Education Knowledge 
of Prospective Teachers 


This study examined the factors affecting the knowledge of prospective teachers in con- 
sumer studies, as measured by a test of consumer knowledge. The data were collected 
from 187 prospective teachers in their final year of study at both Brandon University 
and the University of Manitoba. chi-square analysis revealed that prospective teachers 
who were older, had more experience in consumer matters, had taken consumer related 
coursework, and who felt competent to teach in this area scored significantly higher on 
the test of consumer knowledge than comparative groups. 


As society in general and the marketplace in particular have become more complex, 
the need for consumers to be well informed has increased dramatically. The public 
school system has been considered an appropriate arena for transmission of this infor- 
mation in the form of consumer education courses. Administrators have acknowl- 
edged the significance of consumer education as a subject that provides the seeds for 
teaching students the life skills they require to live and function successfully in our 
complex society. The result has been that the status of consumer education has risen 
to a level that warranted its integration into the school system curriculum. But are the 
teachers knowledgeable in this subject matter area? 

The term consumer education has been defined by many authors. Dalaba (1978) 
defined consumer education as “the development of the individual in skills, concept, 
and understandings required for everyday life to achieve, within the framework of his 
own values, maximum utilization of and satisfaction from his resources” (Brown, 
1983, p. 205). Willett (1979) stated that “Consumer education is an area of study 
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which equips individuals and groups with the knowledge and skills to make effective 
choices and take action regarding the use and conservation of available resources in 
the public and private sectors consistent with individual values and societal needs” 
(Bannister & Monsma, 1979, p. 6). Ina Manitoba Department of Education publica- 
tion, consumer education is defined as “the process of developing skills and under- 
standings in the considered use of personal and environmental resources to respon- 
sibly meet human needs and wants” (1977, p. 1). Essentially, each definition states the 
same three elements with which consumer education should equip pupils: 
(a) knowledge, (b) values, and (c) skills in the face of making consumer decisions. 

There are several major factors which have influenced the effectiveness of con- 
sumer education programs in the schools. One of these is that consumer education as a 
subject area has not been in existence long enough for all the problems to be identified 
and clarified. Considering the way consumer education is defined, one recognizes that 
it potentially captures a wide range of areas and problems associated with everyday 
living such as decision-making related to personal money management, energy con- 
servation, food buying, advertising, consumer laws, and numerous other areas. In light 
of the magnitude of consumer studies, consumer educators face an arduous task of 
teaching effectively in this particular subject area. Secondly, most educators have had 
very little, if any, training or preparation for teaching consumer education related 
subjects. There is considerable reluctance among educators to participate in teaching 
consumer education, possibly because the educators are ill-prepared in the first place. 
In Manitoba’s case, for instance, the curriculum coordinator in the Department of 
Education stated that teachers designated to teach consumer education feel ineffec- 
tive and incapable of teaching these courses (Parasiuk, personal communication, 
1983). 

Using Bloom’s Taxonomy of Educational Objectives, Handbook I: The Cognitive 
Domain (1956), it becomes possible to understand what may be the cause for teachers’ 
feelings of ineffectiveness and incapability to teach consumer education. Bloom 
identifies a hierarchy of learning that takes place in the cognitive domain. The ele- 
ments in this hierarchy are knowledge, comprehension, application, analysis, 
synthesis, and evaluation such that a learner begins with low level memory skills iden- 
tified as knowledge, and ends up with high level skills involving evaluation (Popham, 
1981). Teaching involves a complex process whereby teachers need first to learn how 
to adopt skills, attitudes, and cultivate the knowledge needed to teach students effec- 
tively. 

This study is concerned with examining whether prospective teachers in Manitoba 
have some level of knowledge (which is the first level of Bloom’s cognitive domain) in 
consumer studies using a minimum competency or achievement test (Ebel, 1980). 
Bloom’s definition of knowledge has been adopted for this study as that which “in- 
volves the recall of specifics or universals, the recall of methods and processes, or the 
recall of a pattern, structure, or setting” (Popham, 1981, p. 329). 


Relevant Literature 
A search of the literature revealed that research in consumer education has concen- 
trated predominantly on high school students. Researchers have probed consumer 
education students with tests and follow-up studies in their quest for determining the 
amount of knowledge students have in consumer matters (Langrehr, 1979; Mayer, 
1981). On the other hand, apparently little emphasis has been given to the examina- 
tion of the knowledge and skills of teachers who are designated to teach consumer 
courses. Davis (1978) and Garman (1979) both conducted an autumn 1977 Education 
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Resources Information Centre (ERIC) search which revealed that there were no 
studies specifically dealing with teacher preparation for consumer education. Garman 
found few comprehensive studies attempting to determine the knowledge level of 
educators teaching consumer education. For the purpose of this study, an ERIC 
search was conducted in 1983 and in 1984, and only six studies were found examining 
the competency of consumer education teachers. It seems reasonable to assess 
teachers’ consumer education knowledge as well as considering the assessment of the 
consumer knowledge of students. In the words of Davis, “it is highly probable that if 
teachers are better prepared and competent for teaching consumer education, stu- 
dents will become more competent consumers” (Davis, 1978, p. 191). 

Uhl (1970) found that teachers were not sufficiently equipped to teach consumer 
education and there was a need for qualified teachers in consumer education. Davis 
(1978) found that there was a statistically significant difference between those 
teachers who had no courses in consumer education or related subjects and those who 
had some preparation in these areas. The teachers with some preparation were better 
equipped to teach consumer education. Garman (1979), too, discovered in his survey 
of 4,309 subjects that prospective teachers who finished at least one consumer educa- 
tion course scored higher on an achievement test than those without any consumer re- 
lated courses. Carsky, Lytton, and McLaughlin (1984), in their study involving 182 
university undergraduates enrolled in a consumer education class, found that postsec- 
ondary consumer education courses can lead to positive changes in consumer compe- 
tency of these students. Langrehr (1979) also found significant differences in 
consumer knowledge between students enrolled in consumer education classes and 
those in other classes. 

Garman (1979) observed that major and minor areas of study and gender affected 
performance on the general test of consumer knowledge. Participants with majors in 
social studies, science, and home economics (human ecology) scored higher on the 
test, and participants with minors in home economics and business scored higher than 
those with other minors. Garman also discerned that gender was a possible factor in- 
fluencing consumer knowledge. In his study, males had significantly higher scores on 
the consumer test than did females. 

Besides finding that consumer related courses helped equip teachers to teach con- 
sumer education, Davis (1978) also noted a statistically significant difference between 
teachers who felt competent and those who did not feel competent to teach consumer 
education. Those teachers who had credit courses in consumer studies, workshops, or 
both, did feel more effective and competent to teach. Her findings were that 58.2% of 
600 teachers in Kentucky felt competent to teach, and 32.7% of teachers teaching 
consumer education did not feel competent to teach. As well, 46.4% did not have any 
training in consumer education, and 48.1% did not feel their teacher preparation was 
sufficient. Apparently, according to Davis, if a teacher feels competent the teacher is 
likely to feel prepared to teach. 

Research in the area of teacher preparation in consumer education has dealt little 
with an examination of factors, other than coursework and feelings of competence, 
which are helpful in preparing and indicating the willingness of teachers to teach con- 
sumer education. No research has been found that attempts to examine other factors 
such as age, marital status, and consumer management experiences that may have in- 
fluenced the preparation of teachers. 
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Research Design 

Sample 

The sample was composed of 159 participants from the University of Manitoba and 28 
from Brandon University making a total sample size of 187. It was composed of 96 
males (53% ) and 91 females (47% ) with the majority of prospective teachers being in 
their early 20s. Respondents between the ages of 20-24 made up 62% of the total sam- 
ple, and those 25 years and over made up the remaining 38%. The majority of the par- 
ticipants (67% ) were single. All were prospective secondary school teachers in their 
final year of study and had the following majors or teaching electives: business, hu- 
man ecology, mathematics, science, and social studies. The data were collected in Sep- 
tember 1984. ‘ 


Instruments 

An achievement test entitled the Test of Consumer Knowledge (TCK) was adminis- 
tered which consisted of 48 multiple choice items. This was modified by the research- 
ers from tests developed by Stanley (1975) and Dlabay (1984) to ensure suitability in 
Canada and was pretested with a similar group of senior students during the spring of 
1984. The test items were compared for construct validity with curriculum guidelines. 
The TCK was checked for internal consistency and reliability using the Kuder- 
Richardson formula number 20, and the reliability coefficient was found to be 0.997. 
Stanley’s (1975) Form “A” and Form “B” test had an alternate form reliability coeffi- 
cient of 0.733 and 0.738 respectively (Garman, 1977). Respondents also completed a 
questionnaire designed to: 


1. Determine the number of consumer related courses that have been taken by each 
participant. 


bo 


Measure some particular life experiences of prospective teachers which include 
childrearing experiences and consumer management experiences (Dlabay, 1984; 
Gordon & Lee, 1972; Jelley & Herrmann, 1973) that may affect performance on 
the Test of Consumer Knowledge (TCK). 


3. Measure the feelings of confidence/competence of prospective teachers for teach- 
ing consumer education courses. 


4, Obtain demographic information regarding age, sex, and marital status. 


Data Analysis 

For descriptive purposes a series of cross-tabulative analyses were made using cate- 
gorized data. The dependent variable, score on the TCK, was examined using the chi- 
square statistic, and in some cases gamma was used to test for any relationships with 
the independent variables (Goodman & Kruskal, 1979; Mendenhall, 1979). The sig- 
nificance level for chi-square was set at p=.05 for testing the hypothesis in this study. 


Results and Discussion 

The test scores on the 48-item test called Test of Consumer Knowledge were con- 
verted to percentiles. In general terms, the test results can be subdivided into three 
groups of achievers: low achievers, moderate achievers, and high achievers. In the low 
achievers category, 56 (30%) of participants scored between 35.4% and 69.99%. In 
the moderate achievers category, 70 (37%) scored between 70.77% and 79.99%. Fin- 
ally, in the high achievers category, 61 (33%) scored between 80.00% and 91.7%. In 
the chi-square analyses of hypothesis, test score was grouped into low, moderate, and 
high achievers throughout the study. Of the 187 completed tests, the mean achieve- 
ment raw score on the 48-item test was 35.7 or 74.3% with a standard deviation of 
4.63. 
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TABLE1 


Chi-square Analysis for Independent Variables and Test Score 


—_— ee —eeeeeSSSSSSSSSSSSssFaeeseseseseseseseseeh 


Test Score (Percent) 
Low (n=56) Moderate (n=70) High (n=61) Total 
Variable SY, 70.00-79.99 = 80.00 number percent 
Sex n=187 
Male 2 40 33 96 (100) 
Female BO 34 33 91 (100) 
x*=1.083, df=2, p=0.5817 
Age n=185 
(20-22) 43 37 20 68 (100) 
(23-27) 25 40 30 81 (100) 
(28-51) 17 33 50 36 (100) 
x°=13.362, df=4, p=0.0096** 
Consumer related 
courses n=187 
None 34 38 28 144 (100) 
lor More 16 30 49 43 (100) 
x” =8,082, df=2, p=0.0176* 
Consumer management 
experiences n=187 
Low 51 30 i) 81 (100) 
Moderate 17 46 a7 TL (100) 
High 9 27 54 35 (100) 
x? = 33.298, df=4, p=0.0001*** 
Marital status 
n=187 
Single 35 33 32 125 (100) 
Married and/or 
previously married 19 47 34 62 (100) 
x?=5.681, df=2, p=0.0584 
Childrearing 
experiences (e= Weyl 
None 30 37 35) 160 (100) 
1 or More Children 30 a 33 27 (100) 
x*=0.007, df=2, p=0.9964 
Confidence/ 
competence n=187 
Low 47 34 19 62 (100) 
Moderate 24 39 37 76 (100) 
High 18 39 43 49 (100) 
x?=14.704, df=4, p=0.0054** 
Willingness to teach 
n=186 
Yes 25 36 39 137 (100) 
No 45 39 16 49 (100) 


x?=10.386, df=2, p=0.0056** 
Shaw kc ee une pelle ela lll al A ae el Rae eee 
p<0.001*** 
p<0.01** 
p<0.05* 
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As may be seen in Table 1, sex was not significantly related to test score. These re- 
sults do not support the findings of Garman (1977) who, in his earlier national study, 
found that males scored significantly better on the achievement test than did females. 
This may be due to an increased participation of women in the labor force and more 
involvement of women in major financial decision-making. Age, as expected, was 
shown to have a definite impact on TCK performance. Those prospective teachers 
aged 28 years and over performed significantly better on the test than did the other 
prospective teachers. Respondents were asked to indicate if they had completed any 
consumer related courses. They were asked to identify the title or course number of 
any consumer related courses that they had taken at university. Of the 187 prospec- 
tive teachers, 144 had not completed any consumer related courses whereas 43 had 
completed from one to seven. 

To test the hypothesis that those prospective teachers with consumer related 
courses would score higher on the TCK than those without consumer related courses, 
the chi-square statistic was used. The variable “consumer related courses” had to be 
grouped as those without courses and those with one course or more to avoid difficul- 
ties with the cell size using chi-square analysis (Erickson & Nosanchuk, 1977). The 
test score was grouped according to the low, moderate, and high achievers categories. 
As was hypothesized, consumer related courses were significantly correlated to test 
score. Therefore, those subjects who had taken consumer related courses were more 
likely to achieve higher scores on the TCK than those without these courses. 


TABLE 2 


Differences in Mean Test Scores 
of Prospective Teachers as Related to 
Consumer Management Experiences 


No. of 

Consumer 

Management Mean % Standard Frequency Percent 
Experiences Score on TCK Deviation n=187 (100% ) 
low 0-15 69.86 10.53 81 43 
moderate 16-30 76.70 Ou ia 38 
high 31-65 80.30 6.1 35 19 


gamma=0.518, df=4 


Consumer management experiences such as buying a home, obtaining life insur- 
ance, or filing income tax were found to be very significantly correlated with TCK 
score. Of those with low scores on the TCK, 51% had few consumer management ex- 
periences (0-15) while 9% had high numbers of experiences (31-65). The mean test 
score for the low experience group was 70, for the moderate group 77, and for the high 
group 80. 

The hypothesis that previous experience in consumer matters is a better predictor 
of score on the TCK than the number of consumer related courses taken was strongly 
supported. Using the gamma statistic, the significance level of consumer related 
courses to test score was 0.395, whereas the significance of consumer management ex- 
periences to test score was 0.518. It is apparent that consumer management exper- 
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iences are more strongly related to score on the TCK than consumer related courses; 
consequently, it is a better predictor of test score. 

Confidence/competence of prospective teachers to teach consumer education was 
found to be significantly related to test score. Results found that a higher level of con- 
fidence was significantly related to a higher test score. Those who indicated they had a 
high level of confidence scored significantly higher on the Test of Consumer 
Knowledge than did those who indicated they had a lower level of confidence. This 
finding supports similar findings by Davis (1978). In Davis’ study, teachers who felt 
competent or confident to teach consumer education felt better prepared to teach 
consumer education. This study sought to determine whether those prospective 
teachers that indicated they had a high level of confidence/competence also had more 
knowledge in consumer studies and would display a greater willingness to teach con- 
sumer education. The finding did show that this relationship was significant. 

It was hypothesized that human ecology majors would score higher on the TCK 
than students majoring in business, mathematics, science, and social studies. The 
hypothesis was based on the fact that there are a larger proportion of consumer 
courses in human ecology than in any other faculty (University of Manitoba, Calen- 
dar, 1983-1984); consequently, human ecology students would have greater access and 
opportunity to take consumer related courses. The findings of the study showed that 
human ecology students did score the highest on the TCK, (77.92%), slightly above 
science majors (77.78% ). 

Garman (1977) found that the highest scoring majors on the Test of Consumer 
Competencies were the following in descending order: social studies with a mean per- 
cent score of (63.96), science (63.80) and home economics (63.60). A one-way analysis 
of variance revealed significant differences among the majors. 

In this study, neither marital status of the respondents nor their childrearing ex- 
periences were found to have any relationship with score on the Test of Consumer 
Knowledge. Both variables had collinearity problems with each other and with the 
variables of age and consumer management experiences, which may account for the 
lack of significant findings. 


Implications and Conclusion 

It is apparent from the study that consumer knowledge was influenced by age, con- 
sumer management experiences, feelings of confidence/competence, and number of 
consumer related courses taken by prospective teachers. Only 33% of the sample were 
classified as scoring high on the achievement test, and a major portion of these high 
scorers were made up of those that were older, had more consumer management ex- 
periences, had a high degree of confidence/competence, and had taken consumer re- 
lated courses. The findings of this study reinforced the idea that knowledge about 
consumer matters is not only attainable from the classroom, but is also gained from 
life experiences. School administrators would benefit if they kept these factors in 
mind when they select teachers to teach consumer education. Also, based on the find- 
ing that consumer related courses were significantly related to consumer knowledge, 
it may be beneficial for school systems to provide training or workshops for teachers 
who would be teaching consumer education. 

Concurrently, the findings of the study suggest that school administrators should 
not select consumer education teachers based on gender, marital status, or 
childrearing experiences as might be intuitively expected. It was found that these var- 
iables had no bearing on the level of consumer knowledge of prospective teachers. 
This study did not find that one sex had more consumer knowledge than the other, 
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that married and/or previously married subjects scored higher on the Test of Con- 
sumer Knowledge than single subjects, or that those with the experience of raising 
children had more consumer knowledge than those without children. 

Prospective teachers also need to apply the consumer education knowledge they 
possess in the classroom. Further research may attempt to assess the application of 
this knowledge by prospective teachers, to allow for more effective teacher prepara- 
tion. 
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In Alberta one study partially concerning private schools was completed by a commit- 
tee appointed by the Minister of Education, and a related major study was carried out 
under tendered contract. The Ontario private schools study was conducted under the 
direction of a commissioner appointed by Order in Council. Contextual differences re- 
lated to provincial constitutions and practice are reflected in the approaches taken in 
the studies. It is suggested that the continuing academic and research value of the 
Ontario study exceeds that of the Alberta studies, and that such is the consequence of 
government policies regarding the assignment of each study. The practicality of imple- 
menting the recommendations of a study is contingent on the degree to which these re- 
flect the reality of economic, social, and political forces which sway the decisions of a 
government. 


The purpose of this paper is to compare the commissioning of private school studies in 
Alberta and Ontario, the manner in which the studies were conducted, the findings 
and recommendations arising from the studies, and the usefulness of the study re- 
ports for further research and for policy making. 


Context 
Nearly all early education efforts and the establishment of the first schools in Canada 
can be credited to churches and religious societies. Early settlers arrived on this conti- 
nent even before free public education had become institutionalized in their ancestral 
homelands. Phillips (1957) pointed out that private initiative frequently performed a 
public service before local education authorities were set up. As in eastern Canada, so 
in the west the early schools were established mainly by missionaries and churches. 


Dr. Bergen is with the Department of Educational Administration specializing in the governance of education in the provinces 
and territories. 
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Government Sponsored Private School Studies 


Stamp (1970) explained that “State enterprise in Canadian education grew from mod- 
est beginnings in the mid-nineteenth century to gigantic proportions in the post- 
World War II era” (p. 444). 

According to Stamp (1985), the first private schools were “schools of necessity” 
which provided an education for the children of concerned parents before any state 
organized public school systems were in place. Some colonial governments provided 
some of the early voluntary schools with token grants in order to provide a measure of 
public assistance or to encourage such schools in the improvement of their instruc- 
tional programs. Some of these early private schools became “schools of privilege” 
open to those who could afford the substantial fees. 

Following the establishment of provincial school systems, other private schools 
which Stamp calls “schools of protest,” came into being. Roman Catholic and Jewish 
private schools were organized because the public schools were found to be of Protes- 
tant and Christian hue. In more recent years an increasing number of private schools 
have been established by Protestant denominations which find the public schools to 
be too secular (Bergen, 1981b). The private schools of today, according to Stamp, 
might be called “schools of diversity,” which reflect not only the religious and cultural 
pluralism of current society, but also the intent of groups to strengthen and maintain 
their cultural heritage, or the desire to have their children instructed in schools which 
espouse a particular pedagogical philosophy, such as the Waldorf and Montessori 
schools. 


Ontario and Alberta: Comparisons and Contrasts 

According to the Constitution Act of 1867, one of the provincial powers relates to edu- 
cation. Section 93 states that “In and for each Province the Legislature may exclusive- 
ly make Laws in relation to Education.” Formal school systems had developed in each 
of the colonies which became the first four provinces in 1867, and the denominational 
feature of public schooling in some of these was to be protected by the constitution. 
The Federal Cabinet and Parliament were assigned powers to protect rights and privi- 
leges held by Protestants and Roman Catholics with respect to schools in any of the 
Provinces. The right of minority groups to Catholic or Protestant schools was guaran- 
teed for Ontario by virtue of section 93 of the Constitution Act of 1867 and for Alberta 
also by Section 17 of the Alberta Act of 1905. 

Prior to confederation, a series of legislative acts, culminating with the Separate 
Schools Act of 1863, provided for the placing of Catholic separate schools on an equal 
basis with those of public “common” schools in Canada West. However, financial sup- 
port was not granted beyond Grade 8 (later extended to include Grades 9 and 10) as 
the Common Schools Act of 1859 related only to the first eight grades. Tested in the 
courts of the land and appealed to the Privy Council (England), the latter ruled in 
1928 that separate school supporters had no legal claim to financial support for any 
secondary schools (Foucher, 1985). In response to the support of Catholic voters at the 
polls and the submissions of the Catholic Taxpayers Association of Ontario and of the 
Catholic bishops in the 1930s, the Hepburn Government sponsored an Assessment 
Bill which would improve the lot of separate school supporters. The Bill was passed in 
1936, but repealed the next year following strong opposition from public school sec- 
tors and from Protestant organizations and clergy (Walker, 1976). The issue came to a 
head again in the late 1960s, and when Education Minister William Davis became 
Premier in 1971 he announced that aid to Roman Catholic separate schools would not 
be extended (Stamp, 1982). However, the incessant lobbying for the extension of sup- 
port for secondary separate schools in Ontario culminated in the announcement by 
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Premier Davis in 1984 that such funding would be introduced one grade each year 
until all grades would be covered (Shapiro, 1985). In a three-to-two decision the Court 
of Appeal of the Supreme Court of Ontario supported the constitutionality of Bill 30, 
introduced by the Government on July 4, 1985, providing for the extension of public 
funding to Roman Catholic secondary schools. An appeal to the Supreme Court of 
Canada contesting the decision was heard early in 1987. On June 25, 1987, the 
Supreme Court ruled that the government financing of Ontario’s Roman Catholic 
high schools is constitutional. 

Section 17 of the 1905 Alberta Act guaranteed financial support, equal to that 
available to public schools, for separate schools for all 12 grades, and such support has 
continued without interruption. Consequently, Catholics in Alberta found little need 
to establish private schools. In Ontario, Catholic secondary education was provided in 
private schools, most of which were operating in conjunction with the separate schools 
under school boards offering elementary schooling. 

In Ontario, a notice of intention to operate a private school must be submitted an- 
nually on or before September 1st according to section 15 of the Education Act. This 
notice must include the school’s affiliation, the levels of instruction, enrollment esti- 
mates, teaching staff estimates, and information about the premises. Subsequently a 
report must be submitted listing all staff members, their teaching subjects, and their 
experience. The Minister may direct the inspection of any private school. Generally, 
all private schools are inspected by inspectors of private schools working out of the re- 
gional offices of the Department which serves the geographical areas in which the 
schools are located. Following a visit to a school, the inspector submits a detailed re- 
port on the staff, the students, and the grades of instruction. The Ministry compiles a 
list of all operating private schools. If the inspection is for the purpose of certification 
of teachers or for the granting of high school diploma credits, the Ministry imposes a 
modest service fee. Currently, no direct government grants are available to private 
schools in Ontario. However, free curriculum guidelines are provided to the schools. 

In Alberta, the operator of a private school must receive prior approval from the 
Minister of Education according to section 10 of the Department of Education Act. 
The Minister grants approval on the basis of three main criteria: the qualifications of 
the teaching staff, the adequacy of the program of instruction, and the suitability of 
the physical facilities. The Minister has the authority to cause the inspection of any 
school, whether public or private. As a matter of course, personnel from the provincial 
regional offices of education carry out such inspections in private schools, and pupils 
in such schools generally write any provincial tests which are administered in the pub- 
lic schools. Private schools are also permitted to call on the consultative resources of 
the regional offices at no cost. 

As a consequence of active lobbying by Alberta private school supporters, token 
grants of $100 per pupil were introduced in 1967 (Digout, 1969). These were gradually 
increased until a level of 75% of the per pupil grants for public schools was attained. 
This grant support constitutes approximately one-third of the total revenue per pupil 
for public schools. Generally, if the teachers hold Alberta certification, a nonprofit 
private school is entitled to receive government grants. If teachers are not certificated, 
the school is not eligible for government funding. Judge Oliver’s ruling on the 
Holdeman Mennonite private school case in 1978 supported parental choice in the 
schooling of children. The Minister’s response was to revise private school regulations 
to provide for a category of private school which did not require certificated teachers 
(Bergen, 1981a). 
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In 1984, when Shapiro began his research on Ontario schools, about 535 private 
schools representing about a dozen religious affiliations and several secular associa- 
tions enrolled approximately 87,100 students, of which about 40% were Catholic. 
During the same year, when Woods Gordon Management Consultants examined Al- 
berta private schools, approximately 114 approved schools with an enrollment of 
about 11,400 full-time students were tabulated; very few of these were Catholic as 
Separate Catholic schools provide for instruction in all 12 grades. 

Though some groups in Alberta have been able to conduct alternative schools 
under the general direction of public school boards, such as the Hebrew School in 
Edmonton, any such provisions are subject to the whims of the particular boards. For 
example, though Hebrew and other religiously oriented schools had been established 
as alternative schools under the Calgary public school board, a subsequently elected 
slate of school trustees terminated all such arrangements. This in turn caused some of 
the parents to turn to private schools to meet their wishes, whereas the Hebrew 
schools accepted temporary arrangements which permitted them to operate as alter- 
native schools within the Calgary Catholic Separate School District. 

Alberta Worth Commission (1972) held that “those opting for private schools 
should do so at substantial personal expense” (p. 61). The Commission further warned 
that “any wide-spread encouragement of private schools might lead to an unsustain- 
able degree of educational and social fragmentation” (p. 61). In 1950 the Ontario Hope 
Commission recommended the continued exclusion of private schools from local tax 
monies and provincial grants (Stamp, 1985). 


The Commissioning of Private School Studies 

Alberta Studies 

In response to the Government of Alberta’s concern regarding a public school 
teacher’s anti-Semitic teaching, the Premier directed the Minister of Education to 
undertake a special review of curriculum in order to ascertain whether any changes 
could be made “to foster greater tolerance and respect for minority groups in our soci- 
ety” (Ghitter, 1984, p. 7). On June 27, 1983, the Minister established the Committee 
on Tolerance and Understanding and appointed 12 members representing a wide rep- 
resentation of Alberta citizenship under the chairmanship of lawyer Ron Ghitter. A 
major mandate of the Committee was to “suggest to the Minister of Education ways of 
fostering in the school system, greater tolerance and respect for human rights, funda- 
mental freedoms, and the dignity and worth of all individuals” (p. 8). The Committee 
examined aspects of public, native, intercultural, and private education. This paper 
makes reference only to the part of the Committee’s study concerning private educa- 
tion. 

Concurrently and partly in response to resolutions of the Alberta School Trustees’ 
Association that a thorough examination of private schools in the province be under- 
taken, in July 1983 the Minister of Education called for bids on a proposal for a study 
of private schools in Alberta. The proposal itself had been prepared by the Planning 
Services Branch of the Department of Education and listed more than 50 issues and 
questions to be addressed and some 30 specified activities to be conducted. Private 
consulting firms as well as several university-based educator research teams tendered 
bids; the contract was awarded to Woods Gordon Management Consultants on the 
recommendation of a Steering Committee established by the Minister. The mandate 
of the study was “to examine the development and organization of private schools in 
Alberta and to consider future directions” (Planning Services Branch, 1983, p. 7). The 
study was to include an examination of financial issues and the impact of private 
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schools on public school systems and on the education of children. The 13 members of 
the Steering Committee were selected from teacher, trustee, educational administra- 
tor, and private schools organizations and from the public at large. One of its mem- 
bers, representing the Department of Education, served as Chairman. 


The Ontario Study 

The Davis announcement regarding the funding of Catholic secondary schools in- 
cluded reference to a commission of inquiry regarding private schools. By an Order in 
Council of July 26, 1984, Dr. Bernard Shapiro, then the Director of the Ontario 
Institute for Studies in Education, was appointed sole Commissioner. His mandate 
was to “document and comment on the contribution of private schools to elementary 
and secondary education in Ontario” and included the identification of possible alter- 
native forms of governance of private schools, consideration of the desirability of pub- 
lic funding of private schools, and identifying possible relationships between private 
schools and publicly supported school boards (Shapiro, 1985, p. 78). The Order in 
Council naming Shapiro as the appointed commissioner also gave him the mandate to 
require the Minister of Education to appoint an advisory committee. The 15 members 
of the Advisory Committee included representation from boards of education, teacher 
associations, private school organizations, universities, and the Ministry of Educa- 
tion. 


Contextual Differences 

Three significant contextual differences for the two provinces are of relevance. Alber- 
ta had complete funding for Catholic separate schools for all elementary and second- 
ary grades, whereas in Ontario such funding for Catholic separate secondary grades 11 
to 13 was to be phased in beginning with September 1985. Public funding of private 
schools in Alberta had been in place since 1967, but none existed in Ontario. Finally, 
private schools in Alberta must be authorized by the Minister before they may open 
for instruction, whereas in Ontario the operators of private schools are required only 
to notify the Minister at the beginning of each school year of their intention to oper- 
ate. 


Study Procedures 
The Ghitter Committee and the Shapiro Commission had been given general man- 
dates which permitted each to examine what seemed necessary as the study pro- 
ceeded. The Woods Gordon team, on the other hand, had been given an extended and 
precise list of items to examine. 


The Ghitter Committee Study 

The Ghitter Committee formulated a set of six principles to serve as a “supporting 
structure for ... analysis and recommendations” (Ghitter, 1984, p. 16). These prin- 
ciples stated the Committee’s value positions with respect to an open democratic soci- 
ety, the obligation of parents, maximum opportunities for children, the basic mission 
of education, the public school system, and shared experiences among diverse school 
populations. Many public hearings were scheduled and numerous briefs were re- 
ceived. Only part of this activity included visiting private schools and the examination 
of curricula used in them. A discussion paper on private schools was circulated in 
order to elicit further public response. The Committee reflected on what it had heard 
and what it had read in the briefs, and also on what it had observed in the schools it 
had visited, and then formed eight conclusions which consisted of should and must 
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statements. These, in turn, provided the rationale for the Committee’s six recommen- 
dations. The status of the provincial public school system with respect to curriculum, 
facilities, and teacher qualifications also served as a major criterion for the formula- 
tion of its recommendations on private schools. 

The credibility of the Ghitter Committee rested on the wide representation on the 
Committee and the expertise of the members in addressing any problems they would 
encounter. Its object was to identify any discriminatory biases and to recommend 
ways and means by which such might be reduced, eliminated, or controlled. Although 
the Committee was established in the wake of strongly biased discriminatory instruc- 
tion by a teacher in a public school, its prior attention to private schools caused some 
consternation among some private school supporters. Though it may be assumed that 
the Committee took cognizance of available research studies and that it conducted its 
discussions in the context of educational history, pedagogy, and theory, such refer- 
ences appeared neither in its first discussion paper nor in its final report. The 
Committee’s mode of operation appeared to be largely pragmatic, an assessment of 
what ought to be reasonable and acceptable at this time in Alberta education. Direct 
concern with the past or with the future is implicit rather than made explicit in its re- 
leased reports. 


The Woods Gordon Study 

The Woods Gordon team prepared six background reports and contracted out the 
preparation of four additional reports. One report prepared by Woods Gordon staff 
researchers constitutes a basic data survey on enrollments during the 1978-1983 
period. The team sent questionnaires to private schools, but less than one-quarter of 
these were returned. Since most of the data were available in more complete form in 
the records of the Department of Education and elsewhere, this effort appeared to be 
redundant. A second report provides a survey of private school facilities. The report 
contains no reference to the manner of sampling the schools. A third report provides a 
survey of the opinions held by stakeholders, Department of Education staff, and pub- 
lic and separate school system people. However, the report does not identify which or- 
ganizations were represented by the “stakeholder” returns, nor does it define the use 
of the term. Questions solicited opinions on why parents chose to send children to pri- 
vate schools, on the right of parents to choose private schools, and on the question of 
whether private school supporters ought to have the right to direct their taxes to pri- 
vate schools. Due to the low return of questionnaires and the lack of documentation of 
the precise source of the received returns, this report has little value for subsequent 
reference. A fourth report contains a cross-country survey on grants to private 
schools, private school curricula, and the administration of private schools. Unfortun- 
ately, the report provides merely a tabulation of totals of responses, without 
indicating the status for any one province. A fifth report concerns the financial struc- 
ture of private schools in Alberta. Finally, a sixth report prepared by the Woods 
Gordon staff contains a literature review consisting of selected brief notes on articles 
on the history of private schools, their funding, enrollments and quality of education, 
and on alternative schools. A two-page summary of the literature review comprises 
about 20 brief statements, most of which begin with the phrase “some sources argue 
that.” Some significant recent publications had not been identified by the reviewers. 
A critique on the contents of the review is not provided. Furthermore, such a compila- 
tion had already been completed by university researchers and would have been avail- 
able to the Woods Gordon team. 
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Of greater substantive and critical value are the reports prepared by outside ex- 
perts to whom the Woods Gordon team had contracted their preparation. A law firm 
(Parlee, Irving & Associates, 1984) surveyed 44 private schools with respect to their 
legal status. It provided a review of statutes concerning private schools in the prov- 
inces, including legal opinions on the implications of section 93 of the Constitution 
Act of 1867, the Alberta Individual’s Rights Protection Act and the Canadian Charter 
of Rights and Freedoms. A well documented report was prepared by an educator-re- 
searcher (Harvey, 1984), who had made many on-site visitations in addition to a care- 
ful examination of the content of the curricula, the methods of teaching, and the 
instructional environment. This report continues to be a useful reference and can be 
used as-a base study against which future developments might be measured. Another 
educational researcher (Pain, 1984) carried out the testing of students in private 
schools, as well as those of a comparison public school group, at the grades three, six, 
and nine levels. The researcher also documented the attitudes of students. Though 
the report is useful in that it documents a carefully carried out testing program, the 
researcher warns that the sampling was too small and that results must be interpreted 
with caution. Meaningful overall differences were not found between private and pub- 
lic school students. A former director of a regional office of education (Andruski, 
1984) surveyed the opinions of all five current directors of regional offices with respect 
to the adequacy of the current system of approving and monitoring private schools, 
the evaluation of teachers and of instruction, and of the curricula and facilities. Since 
well documented responses were obtained from the total population of directors of re- 
gional offices, this report also can serve as a base against which developments can be 
assessed. 

The Woods Gordon team met with the appointed Steering Committee on a regular 
basis, checking its findings and recommendations with the Committee and hearing 
counsel from members of the Committee in the course of the preparation of its final 
report. 

In its main report the Woods Gordon team provides a brief statement on the con- 
text of education in Alberta, followed by seven policy considerations. These lead to 10 
key questions, six of which are should (what ought to be) questions, and four of which 
are what is (state of affairs) questions. For each of the 10 questions the report pro- 
vides a brief summary of the data or information given in the corresponding ancillary 
report, including opinions gathered during the collection of the data. This is 
succeeded by one or more conclusions, most of which take the form of belief (“we be- 
lieve that”) or indicator (“opinions indicate that”) statements. These conclusions, in 
turn, provide the rationale for one or more recommendations in response to the posed 
question. 


The Shapiro Commission Study 

In preparation for the writing of the Commission report, Shapiro visited 40 private 
schools, received 514 written briefs, held more than 20 meetings with interest groups, 
and informal consultation with more than 100 persons and groups, and met with his 
Advisory Committee six times. Each of the background papers was prepared by aca- 
demic researchers. One paper (Stamp, 1985) gives an overview of the development of 
private schools in Ontario and an analysis of the nature of these schools in the context 
of changing conditions in public education. A second paper (Finkelstein, 1985) pro- 
vides a detailed analysis of the legal and constitutional context within which private 
schools operate. A third paper (Holmes, 1985) provides an outline of various philo- 
sophical and value positions and their implications for funding, an analysis of 
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alternative funding policies, and a recommendation for the implementation of a pre- 
ferred option. A fourth paper (Lawton, 1985) provides an analysis of alternative 
methods of financing private schools and estimates of the cost to government of im- 
plementing each of the options. In addition, the report has appended examples of 
alternative school practices within three city public school systems. 

The appended background research papers form an invaluable part of the total re- 
port and serve as the resource for the rationale Shapiro provides for each of his recom- 
mendations which take the form of policies for action. 


Findings and Recommendations 

The Ghitter Committee Report 

For Alberta private schools, the Ghitter Committee recommended that the current 
level of funding at 75% of the per pupil grant to public schools be maintained, but 
that an additional 5% of funding be made available for the promotion of liaison and 
shared experiences between public and private schools. Current legislation provides 
for such shared arrangements (Sloan, 1980). The Committee recommended that 
additional funding be made available for private schools which provide special educa- 
tion programs for handicapped students. Also, it recommended that Alberta Educa- 
tion develop review procedures to ensure that all curriculum materials used by private 
schools meet the same basic standards required of public schools and in particular the 
“basic criteria of tolerance, understanding and respect for all people” (p. 114). The 
only major change from current practice was the Committee’s recommendation that 
all private schools be required to employ only certificated teachers and that there be 
only one recognized category of private schools. 


The Woods Gordon Report 

The Woods Gordon study concluded that private schools have a statutory right to 
exist in Alberta in accordance with the Canadian Charter of Rights and Freedoms and 
Alberta Bill of Rights, but that Alberta Education “has the legal power to exercise 
control” over the content of curricula, teacher qualifications, and student perform- 
ance (p. 21). The report states that although Alberta Education is not legally required 
to provide any funding to private schools, the current level of support should be main- 
tained and that consideration should be given to capital support for selected equip- 
ment and eligibility for the same types of funding grants as public schools, including 
some transportation costs. It concurs with the Ghitter Committee recommendations, 
that a class of private schools not requiring certificated teachers be eliminated, and 
that improvements of the curricula in some of the private schools were necessary. Fur- 
ther, the report holds that “home schooling sites” be allowed to continue under special 
circumstances only, such as geographic isolation, the disabilities of students, or even 
the accommodation of the religious or other convictions of parents. 


The Shapiro Commission Report 

One external survey provided the Commission with the status of private schools 
across Canada and in other countries. Three 1984 independent public opinion polls in 
Ontario showed a range of 30% to about 45% of respondents indicating a measure of 
support for the public funding of private schools (Shapiro, 1985, p. 19). This degree of 
support was relatively low compared with Alberta, where public opinion polls during 
the same year indicated that about two-thirds of all respondents favored some public 
funding of private schools (Bergen, 1986). By no means did Shapiro find consensus 
among private schools and their respective associations. Those who advocated public 
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funding did so on the basis of a prior right of choice and of fairness in view of the fact 
that all pay taxes in support of schooling. Some advocated public funding, but with 
reservation, in case such funding would entail some loss of autonomy. Others opposed 
public funding, valuing independence more highly than any form of public support or 
stressing the importance of the separation of church and state in education. 

The Commission set the stage for its recommendations by stating its position that, 

On moral grounds, limiting public support to Roman Catholic schools seems indefensible, for 

the constitutional provisions that are usually advanced to justify the special status of such 

schools serve only to describe its history. They do nothing to inform us about what we ought 

to do. (p. 48) j 
and that, 

On legal/constitutional as well as on moral grounds, the Commission believes that the special 

status of the Roman Catholic schools is discriminatory. (p. 49) 

The Commission’s understanding of the Constitution Act of 1867 was that section 938 
“specified only which schools must be funded and not which others could not be 
funded” (Shapiro, 1985, p. 49). Together with the antidiscrimination provisions of the 
Canadian Charter of Rights and Freedoms an argument could be provided for public 
funding of private schools. The extension of public funding to the secondary Roman 
Catholic Schools increased the strength of the argument in the opinion of the Com- 
mission. Furthermore, the Commission held that citizens ought to have the right to 
make alternative choices and therefore not be bound by a legal public monopoly in 
education. 

A major recommendation of the Shapiro Commission states that where possible a 
private or independent school should operate as an “associated school” in agreement 
with a local school board. Such a school should receive a grant equal to the average per 
pupil operating costs of a school under a local school board multiplied by the enroll- 
ment in the associated school (1985, p. 54). Such an arrangement might include the 
sharing of programs and the provision of administrative services by the school board. 
Current Alberta legislation allows a limited form of the “association” Shapiro recom- 
mends (Sloan, 1980). The Commission report stresses that associated school arrange- 
ments be optional. The report emphasizes that in any private school satisfactory 
instruction must be provided for each student, and that all teachers be qualified, 
either by Ontario certification or deemed to be qualified by the Minister of Education. 
The report further recommends that private schools participate in some measure in 
capital and transportation grants, and receive funds for the purchase of learning ma- 
terials. Also home schooling ought to be permitted on informing the director of educa- 
tion of the local school board. However, diversity within the public school system 
should also be encouraged, and alternative schools within school boards should be rec- 
ognized as legitimate educational enterprises. 


Discussion 
One of the major concerns of the Ghitter Committee addressed the curricular content 
of private and public schools. Subsequently the Department of Education completed 
an audit of selected private school programs (Alberta Education, 1985). According to 
the audit report, some private schools had not adequately covered the Alberta Pro- 
gram of Studies. The principles for a new school act formulated by the Policy Advi- 
sory Committee for the School Act Review (Fyfe, 1985) supported the freedom of 
parents to choose private schooling for their children. Bill 59, which received first 
reading in the Legislative Assembly on June 17, 1987, would change little with respect 
to the current status of private schools in Alberta, but adds provisions for home 
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education programs under the supervision of school boards. The Woods Gordon re- 
port supported home study provisions. The Bill also makes provision for charging 
school principals with the duty of ensuring that the instruction provided by teachers 
is consistent with the prescribed or approved courses of study. This specification re- 
flects the concern expressed in the Ghitter Committee report regarding curricular 
content. Following public discussion of the Bill and subsequent debate in the Assem- 
bly, anew School Act is to be proclaimed and in force for September 1, 1988. 

On June 25, 1987, the Supreme Court of Canada ruled that Ontario’s financing of 
Roman Catholic separate high schools is constitutional and that the Charter of Rights 
and Freedoms does not apply to the rights outlined in section 93 of the Constitution 
Act (1867). Following this decision, the Government of Ontario can expect powerful 
lobbying on the part of private school supporters who would welcome the implemen- 
tation of some of the recommendations of the Shapiro report, particularly with re- 
spect to government assistance in the financing of private education. 

Whereas the Alberta reports largely reflected the status quo and offered recom- 
mendations for modest adjustments and changes, the Shapiro Commission broke en- 
tirely new ground in that the philosophical and value foundations for new policy 
decisions had to be addressed. Whereas in Alberta separate school funding had in- 
cluded all grades since the establishment of the province, in Ontario the funding of 
the senior high grades of Catholic separate schools was not introduced until 1985. In 
fact, in 1985 a private school sector of Catholic education was drawn into the public 
support realm. Included in the same announcement by Davis (Shapiro, 1985) was the 
matter of extending the funding of Catholic schools and of inquiring whether public 
funding of independent schools would be desirable. It may be assumed that the in- 
quiry on private schools would not have been commissioned at this time had a deci- 
sion not been made to fund secondary Catholic schools as an integrated part of the 
separate school system. It also follows that the recommendations formulated by the 
Commission on Private Schools would relate to the changed situation in the funding 
of Catholic schools in Ontario. 

The reports for each of the two provinces addressed issues and concerns related to 
the constitutionality and legality of funding private schools, the freedom of choice in 
the context of the Canadian Charter of Rights and Freedoms, the concern that stu- 
dents should be provided with quality education, the relationships between private 
and public schools, the impact of private school funding on provincial educational 
costs, and the impact of the growth of private schools on public schools. The studies 
recognized that major decisions had to be relevant in the context of today’s pluralistic 
and multicultural society. 

An important difference between the Ghitter Committee and Woods Gordon re- 
ports and the Shapiro report is that in the Alberta reports the private schools were 
perceived as an acceptable but tangential adjunct to public schools, whereas Shapiro 
held private schools to be an integral part of the provincial education system—“like 
their public school counterparts . . . schools of Ontario” (Stamp, 1985, p. 205). Hence 
many of Shapiro’s recommendations are addressed to all the schools of Ontario, not 
just to the private schools. 


Usefulness of Reports for Research 
According to Holdaway (1986, p. 260), “Basic research can have a long-term effect on 
policy (and practice), but immediate, direct linkage should not be expected.” The pre- 
sentation of basic research was not the expectation held for any one of these govern- 
ment sponsored studies. Their usefulness for further research is therefore limited. 
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Since the Ghitter Committee provided no ancillary reports, we are left with a record of 
its opinions and beliefs, and its subsequent conclusions and recommendations. Little 
factual data has been recorded in its final report. 

The Woods Gordon team gathered considerable data, much of which, however, is 
not sufficiently comprehensive and lacks precise documentation. Much of the data 
gathered by Woods Gordon was already available, and some of it in better form than 
that which was collected. Duplication of effort and cost might have been avoided had 
the team included the essential expertise to identify the information which was al- 
ready at hand. One may ask why the Planning Services Branch required the contrac- 
tor to gather basic data which were readily obtainable, in part within the Department 
of Education itself. The ancillary reports on the legal context for private schools in Al- 
berta, on the curriculum, teaching methods, and instructional environment of private 
schools, and on the status quo of Alberta’s monitoring and control of private schools 
are useful. These three ancillary reports were prepared through sublet contracts. 

Two ancillary studies of particular note are included in the Shapiro Report. One 
concerns the philosophy, values, and implications for funding private schools 
(Holmes, 1985), and the other alternative methods of financing private schools 
(Lawton, 1985). Both of these would be helpful to researchers on private schools in 
other provinces. 


Relevance of Studies for Policy Making 

Governments can make policy without consulting researchers. And the governments 
of Alberta and Ontario can make policy governing private schools without directing 
committees and commissions to produce recommendations. Wirt and Mitchell (1982) 
pointed out that “the utilization of research or evaluation findings in policy formation 
is clearly optional” (p. 2). Research knowledge is “only one source for decision mak- 
ing,” and “public policy was being formulated long before there were any social sci- 
ences” (p. 2). 

In an examination of the question whether research matters to policy makers, 
Holdaway (1986, p. 258) concluded that “policy usually results from social, economic, 
and political forces and from personal sentiment rather than from research findings.” 
Apparently in none of the five provinces which have introduced funding of private 
schools did research precede government policy. In Alberta (Digout, 1969), for ex- 
ample, the lobbying of private school interest groups persuaded the Government to 
introduce such funding. As outlined by Wirt and Mitchell (1982), governments are 
moved to act when the interests or demands of constituencies become “concrete and 
identifiable,” are “transformed into issues,” and when “specific proposals for action 
are formulated” (p. 6). For example, the unrelenting demand of Roman Catholic sepa- 
rate school supporters in due time caused former Premier Davis to announce a gov- 
ernment policy in support of the extension of funding to include all high school 
grades. However, realizing that the decision would arouse the concern of non-Catholic 
private school supporters, Davis simultaneously announced the appointment of a 
commission to study the question of public funding of private schools. It appears that 
the “personal sentiment” of Davis had also changed during his term of office. His 
action might be compared to that of Trudeau who in the 1970s announced the 
Government’s policy on bilingualism and biculturalism. Anticipating the concerns of 
Canada’s many ethnic groups, he simultaneously announced a policy in support of 
multiculturalism (Spicer, 1986). However, one may speculate whether Davis would 
have made his controversial announcement had he decided to lead his party in an- 
other election. 
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Coleman and LaRocque (1983, p. 247) emphasized that policy-relevant research 
should be designed “to provide data and interpretations which have predictive utility 
concerning consequences of policy options.” In the words of Phillips and Phillips 
(1984) politicians are concerned “how the costs and benefits of their actions will be 
distributed among the citizenry” (p. 379), and that researchers should provide 
decision-makers “with an explicit framework which describes how the policy should 
be implemented to achieve maximum net benefits” (p. 380). Of the three studies dis- 
cussed in this paper, the Shapiro study best meets this criterion. However, the Ghitter 
and Woods Gordon studies related to major government support programs already in 
place, whereas the Shapiro study concerned recommendations for the initiation of 
such programs. Though the Ghitter and Woods Gordon reports recommend some 
additional support for private schools, it is most unlikely that during a time of reduc- 
tion in grants to education any such additional support would be instituted. Phillips 
and Phillips said that policy research cannot be done in a vacuum, but must be done 
with an awareness of the political context. All three studies satisfied this criterion. 

One might say that the Government’s decision to establish the Ghitter Committee 
was prompted by both social and political factors as a consequence of the Keegstra af- 
fair. The Woods Gordon study likely was prompted by the same factors, but for a dif- 
ferent reason. A Government policy of funding private schools was in place; however, 
questions were being raised about the adequacy of private schools, and in particular 
those not being required to hire certificated teachers. Though a previously announced 
intention of slightly increasing funds to private schools was placed on hold, the study 
provided the Government with support for its current policy, as well as for any con- 
templated changes. 

Wirt and Mitchell (1982) held that because “political actors must be concerned 
with securing and maintaining power” (p. 3), their use of research knowledge findings 
depends on its enhancement of their political influence. For example, though the Al- 
berta Government due to budget restraints found itself compelled to reduce grants to 
education in 1987, it passed compulsory car seat-belt legislation with reluctance as an 
encumbrance on individual freedom, and only after much persuasion of the medical 
professionals who claimed that such a universal seat-belt observance would result in 
substantial reduction in provincial health care costs. In view of provincial cost-cutting 
policies, the largely provincially funded internship program for newly certificated 
teachers was terminated for 1987-1988 even though the results of an evaluation 
strongly supported its continuation (Gee, 1987). Such policy decisions, in the context 
of the concurrent provincial economy, support a prediction that recommendations 
contained in the Ghitter and Woods Gordon reports which would entail additional 
public funding of private schools would not likely be implemented at this time. 

Shapiro (1985) cautioned in the introductory part of the Commission Report, that 

the “facts” yielded by research studies can not in themselves respond to the public policy 

question of whether to provide more public funds to private schools. The response to such a 

question is not so much a matter of facts as it is a matter of values. (p. 38) 

Shapiro raised the issue of values when he questioned whether it were discriminatory 
to fund Roman Catholic schools without providing similar consideration to schools of 
other religious and also secular groups. This, then, appears to be the strongest crite- 
rion in the Shapiro report in support of his recommendations. The Ontario 
Government’s response to his recommendations may be motivated more by social and 
political than by economic forces. The issue of whether Ontario’s Bill 30, providing for 
the financing of Roman Catholic education to the end of high school, is discriminatory 
and contravenes the Charter of Rights and Freedoms, was placed before the Supreme 
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Court of Canada in January 1987. More than a dozen groups, including the Govern- 
ment of Alberta, were interested participants, each presenting a factum to the Court. 
Much political interest was evident, as the Court’s decision affects not only Catholic 
and perhaps also private schools in Ontario, but was perceived to have potential im- 
pact on educational systems in other provinces. The Court has settled the Roman 
Catholic separate school issue. It might be called on to adjudicate on private school fi- 
nancing, should the Ontario Government introduce such on the basis of the Charter. 


Conclusion 

A government may decide to appoint a commission or a commissioner to do a study by 
means of an order in council. If the appointed person is adept in educational research 
and has an understanding of historical and philosophical factors and has a sensitivity 
to the prevailing political environment, and also is given the autonomy to develop the 
nature of the study, the report may be of more permanent value and impact and be of 
greater worth to other researchers. Such was the case for the Ontario Shapiro study 
and report. A government or its minister may also appoint a committee comprising a 
selected chairman and members representing various sectors of the public. The find- 
ings and recommendations of such a committee are more likely to reflect the current 
concerns of the public, and, however useful and appropriate such recommendations 
may be, these need not be rooted in intensive research or in theoretical formulations. 
Such was the case of the Alberta study of the Committee on Tolerance and Under- 
standing under the chairmanship of Ron Ghitter. A government may also choose to 
sublet a contract, and even though it calls for bids, it may have reasons for selecting a 
bid from a management firm rather than one presented by educational researchers. If 
a general management firm is selected, it is less likely that theoretical and research 
underpinnings will be major concerns in the prepared report, but rather that the cur- 
rent practicality of its recommendations will characterize the report. Such was the 
case of the study of the Woods Gordon Management Consultants. 

The current economic prospects in Ontario are brighter than those of Alberta. 
Hence the factor of costs may not weigh as heavily in the consideration of implement- 
ing any of the Shapiro recommendations. The Peterson Government might find it 
useful to gain a new mandate at the polls before it is faced with dealing with the con- 
troversial issue of private school funding. 

The policy making value of any study depends on whether its recommendations 
can be implemented while at the same time enhancing the political strength and social 
status of the Government. Nevertheless, an unanticipated major change in the eco- 
nomic and political environment can cause otherwise reasonable recommendations to 
be set aside while other pressing issues receive priority attention. The policy making 
usefulness of a study is contingent on the degree to which its recommendations reflect 
the reality of economic, social, and political forces which sway the decisions of govern- 
ments. Government policies are likely to reflect the dominant public values and cur- 
rent political milieu in greater measure than any facts established by research alone. 


References 
Alberta Education. (1985). An audit of selected private school programs. Edmonton: Author. 


Andruski, N. (1984). Results of the survey of directors, Alberta Educational Regional Offices. 
Mimeographed report. 


304 


Government Sponsored Private School Studies 


Bergen, J.J. (1981a). Freedom of education in a religious context: Alberta Holdeman private 
school case. Mennonite Quarterly Review, 60(1), 75-85. 

Bergen, J.J. (1981b). The private school movement in Canada. Education Canada, 21(2), 4-8. 

Bergen, J.J. (1986). An examination of private school issues in Alberta. Alberta Journal of 
Educational Research, 32(2), 91-108. 

Coleman, P., & LaRocque, L. (1983). Linking educational research and educational policy via 
policy-relevant research. Alberta Journal of Educational Research, 29(3), 242-255. 

Digout, S.L. (1969). Public aid for private schools in Alberta: The making of a decision. 
Unpublished master’s thesis, University of Alberta. 

Finkelstein, N. (1985). Legal and constitutional aspects of public funding for private schools in 
Ontario. In B.J. Shapiro (Ed.), The report of the commission on private schools in Ontario 
(pp. 81-108). Toronto: Ministry of Education. 

Foucher, P. (1985). Constitutional language rights of official language minorities in Canada. 
Ottawa: Canadian Law Information Council. 

Fyfe, M. (1985). Partners in education: Principles for a new school act. Edmonton: Alberta 
Education. 

Gee, T.W. (1987). Internship: Alberta experience. Education Canada, 27(1), 10-13. 

Ghitter, R. (1984). Committee on Tolerance and Understanding final report. Edmonton: 
Alberta Education. 

Harvey, A. (1984). Curriculum, teaching methods and the instructional environment in private 
schools. Mimeographed report. 

Holdaway, B.A. (1986). Making research matter. Alberta Journal of Educational Research, 
32(3), 249-264. 

Holmes, M. (1985). The funding of private schools in Ontario: Philosophy, values and 
implications for funding. In B.J. Shapiro (Ed.), The report of the commission on private 
schools in Ontario (pp. 109-151). Toronto: Ministry of Education. 

Lawton, S. (1985). Alternative methods of financing private schools in Ontario. In B.J. Shapiro 
(Ed.), The report of the commission on private schools in Ontario (pp. 153-191). Toronto: 
Ministry of Education. 

Pain, K. (1984). Report on the results of student testing. Mimeographed report. 

Parlee, Irving, Henning, Mustard & Rodney (Associates). (1984). Legal literature review and 
response to legal and organization surveys. Mimeographed report. 

Phillips, C.E. (1957). The development of education in Canada. Toronto: Gage. 

Phillips, K.B., & Phillips, L.T. (1984). Research, politics, and the dynamics of policy 
development: A case study of motor carrier regulatory reform. Policy Sciences, 17(4), 
367-384. 

Planning Services Branch. (1983). A proposal for a study of private schools in Alberta. 
Edmonton: Alberta Education. 

Shapiro, B.J. (Ed.). (1985). The report of the commission on private schools in Ontario. Toronto: 
Ministry of Education. 

Sloan, L.V. (1980). A policy analysis of legislation permitting public-private school agreements 
for the provision of educational services. Unpublished doctoral dissertation, University of 
Alberta. 

Spicer, K. (1986, September). Educating educators for a pluralistic society. Address to the 
Convention of the Canadian Education Association, Winnipeg. 

Stamp, R.M. (1970). Government and education in post-war Canada. In J.D. Wilson et al. (Eds.), 
Canadian education: A history (pp. 444-470). Scarborough, ON: Prentice-Hall. 

Stamp, R.M. (1982). The schools of Ontario, 1976-1976. Toronto: University of Toronto Press. 

Stamp, R. (1985). A history of private schools in Ontario. In B.J. Shapiro, The report of the 
commission on private schools in Ontario (pp. 193-208). Toronto: Ministry of Education. 


305 


J.J. Bergen 


Walker, F.A. (1976). Catholic education and politics in Ontario (Vol. Il). Toronto: Federation of 
Catholic Education Associations of Ontario. (First Printing 1964) 


Wirt, F.M., & Mitchell, D.E. (1982). Social science and educational reform: The political uses of 
social research. Educational Administration Quarterly, 18(4), 1-16. 


Woods Gordon Management Consultants. (1984). A study of private schools in Alberta. 
Edmonton: Alberta Education. 


Worth, W.H. (1972). Report of the commission on educational planning. Edmonton: Queen’s 
Printer. 


306 


Contents of Volume XX XIII, 1987 


Ansu-Kyeremeh, K. Community education for the development of arural Ghanaian village. 
No. 1, March, 43-61. 

Bergen, J.J. Government sponsored private school studies in Alberta and Ontario. No. 4, 
December, 292-306. 

Brand, A.G., & House, G. Relationships between types of assignments, writer variables, and 
emotions of college composition students. No. 1, March, 21-32. 

Fagan, W.T. A comparison of the reading processes of adult illiterates and four groups of school 
age readers. No. 2, June, 123-136. 

Gee, T.W. Drafting and revising processes in grade 12 students’ examination writing. No. 2, 
June, 96-114. 

Housego, B.E.J. Critical incidents in the supervision of student teaching in an extended 
practicum. No. 4, December, 247-259. 

Jacknicke, K.J., & Rowell, P.M. Alternative orientations for educational research. No. 1, March, 
62-72. 

Johnson, N.A. The pervasive, persuasive power of perceptions. No. 3, September, 206-228. 

Kagan, D.M. Cognitive level of student teachers and their perceptions of cooperating teachers. 
No. 3, September, 180-190. 

Lytton, H., Rogers Maunula, S., & Watts, D. Moral judgments and reported moralactions: A 
tenuous relationship. No. 3, September, 150-162. 

Miklos, E., & Greene, M.L. Assessments by teachers of their preservice preparation programs. 
No. 3, September, 191-205. 

Noble, E.J. Redressing the negative influence of socioeconomic status and gender ina 
treatment facility school. No. 2, June, 137-143. 

Quigley, T.A. Slack, T., & Smith, G.J.. Burnout in secondary school teacher coaches. No. 4, 
December, 260-274. 

Rogers, W.T., & Holmes, B.J. Individually administered intelligence test scores: Equivalent or 
comparable? No. 1, March, 2-20. 

Sarros, J.C., & Friesen, D. The etiology of administrator burnout. No. 3, September, 163-179. 

Sharman, R.G.. Organizational supports for implementing an instructional innovation. No. 4, 
December, 236-246. 

Snart, F., Das, J.P., & Barriault, B. Information integration theory extended: Cognitive coding 
in young children. No. 4, December, 275-282. 

Steier, L.P. Computer technology and education. No. 2, June, 82-95. 

Thor, A., & Berry, R.E. A Canadian assessment of consumer education knowledge of 
prospective teachers. No. 4, December, 283-291. 

Willinsky, J.M. As some children write more than others: The impact of learning to write. No. 2, 
June, 115-122. 

Winzer, M. Mainstreaming exceptional children: Teacher attitudes and the educational 
climate. No. 1, March, 33-42. 


ad 


a) - 
ar = aw enrabiens gp tomar weet 


¥ & > .* ytmie eet» oe pe! ‘— Po  - | 


Last os if). shi tn¢ &_-9 bmi , er 7 
he a e 6 = 
J 7 eo aaa : oA | ” = 4 é - 
> * Awe J oe We 
remain egelaR™ A staieci 6 


pe 
on = 
ate ena wait 


en ant oe TF 


a . 
—— “pean? ot abe £2 ine 344.024 A 


oe ae Pa ening y= ch.7 
mm es | onah J ,@ eve 


: f rs <4 > — age" rat wa, 
i dita : a 
7 : a = 
weet Geiser Geer . 1% 
iet-oy wanes 4. - ae 
yi ms Ownage 4 aogel | A sein» 
ars ste 
a ; 
et = => vay nest - i= ”. 
: _ 
a 4 e ’ - we o Sylbend oailes a Le 
| BS 406 wala ask «0 
ere ol, 6 Asteresht ie oi ert 
shoud 4 ez wet ae: > 


sana eae) 08 2 gotinct 


it eure A ytann®. 9, 


é at Ze 4 Pu as pA Is 7 a 


‘4 ay op fame lsncmne Oy St a << 
/ wi) - > 
a. 7 


aT, on ar Atte ine. * 
ice le 

eae as ane 

mi vera) 


For Contributors 


Please prepare manuscripts in accordance with these guidelines. 


Format e Manuscript in triplicate. 

¢ Typewritten, double-spaced throughout. 

¢ Abstract of approximately 100 words, on a separate sheet. 

¢ Manuscript not to exceed 6,500 words excluding graphics. 

¢ Author’s name and affiliation on a separate title page; title only 
on first page of manuscript to ensure anonymity in the review 
process. 

Referencing ¢ Sources in parentheses after each reference (direct or 
otherwise), giving author’s name (unless mentioned in text), 
year of publication, and page number(s). 

e Direct quotes of 40 words or less in double quotation marks 
incorporated into text; those longer than 40 words in indented 
block format. 

e All sources listed alphabetically at the end of the manuscript 
under the heading References using the APA style. 

¢ Explanatory notes, numbered consecutively and marked in the 
text with superscript numerals, may appear before the 
References under the heading Notes; any citations in notes 
follow the same format as other references. 

Graphics ¢« Each table and figure on a separate page at the end of the 
manuscript. 

¢ Figures, charts, and diagrams in camera-ready form. 

¢ Number tables in Arabic numerals and center Table I over each 
heading. 

¢ Number figures in Arabic numerals before the caption at foot of 
each graph, e.g., Figure 1:... 

e Indicate placement of figures and tables in text, i.e., Insert 
Table 3 here. 

Style « AJER’s editorial style conforms closely to the Publication 
Manual of the American Psychological Association. 

e Editorial changes may be made to manuscripts. 

¢ For spelling, consult Webster’s New Collegiate Dictionary, 
spelling in quoted material remains as in the original. 

Diskettes ¢ If manuscript is produced on an IBM, IBM compatible, or 
Macintosh computer, please also send diskette. Microsoft Word, 
MultiMate, WordPerfect, WordStar, Xerox Writer, XyWrite, 
and DisplayWrite 3 are acceptable. 


Submission to AJER implies the manuscript is not being considered for 
publication elsewhere. 


AJER is indexed in the Alberta Education Index, Canadian Education Index, 
Current Contents/Social and Behavioral Sciences, ERIC/Current Index to Journals 
in Education, and Social Science Citation Index; appropriate articles are abstracted 
in Cognitive Development Abstracts, Educational Administration Abstracts, 
Educational Technology Abstracts, Family Abstracts, Language Behavior 
Abstracts, Multicultural Education Abstracts, Psychological Abstracts, Research 
into Higher Education Abstracts, School Organization and Management Abstracts, 
and Sociology of Education Abstracts, and reviewed by the Journal for Research in 
Mathematics Education. 


